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Description 
Field of the mventlon 

rocOl] The present Invention relates to antbody proteins that specifically tod fibroblast «<^^^" PJ^^^^^ 
i^Pa). The invention also relates to the use of said antibodies for diagnostic and therapeutK; purposes and methods 
of producing said antibodies. 

Backgmundof the irmnUon 

[00021 The invasive growth of epithelial cancers is associated with a number of characteristic cellular and molecular 
^aes in*iWoi!ng stromTA highly consistent molecular trait of the reactive stroma of many types of eprtheha^ 
^ce^rrd^^ref^owastac^^proteina^ 

SadS^e strotral fibroblasts originally identified with monoclonal antibody F19 (Gann-Chesa P. OW L. J. a«l Rethg W. 

cSSrface glycoprcJein of reactive stromal f ibroWasls as a poterrtal antibody target in human e^^^c^ 
c^^NTZJsciZT. 7235). Since the FAP antigen is selectively expressed in the stroma of a «nge of ep^ 
Sal S^dSas independent of location and histological type, a FAP-targeting concept has been deve^ed for 
Sng S-osis^andTeatmentofepitftiaalcancers^ 

body twmed F19 that specif ically binds to FAP was developed and descrtoed in US Patent 5.059.523. which is hereoy 
20 incoroorated by reference in its entirety. . . „ u. ♦hat 

S One serious problem that arises when using non-human antibodies for »PP''<^»""^".^'^ '"^""^J^^ 
h^uicWyraiseahu^nanti-non-human response Which reduces the 

SSadministration. Humanisation of non-human antibodies is commonly adiieved in one of tv«) ways_ ^^^^^ con- 
Sucting non-humanmuman chimeric antibodies, wherein the non-human vanaUe regions are pined to human con- 
SSiJSriuHanne Q. L. Hozumi N. and Shulman. M .J. (1984) Production of functional chimaenc mousertuiman 
^SKSreS1:6S3)or2)by grafting the co^^^ 

SSrSirto ^"lan viiatle rii'ons and then Joining these "reshaped ^-^'^^'^^Zl^^^^^e 
reoiorw^^Riechmann L Clarii M.. W&Wmann H. and Winter a (1988) Reshaping human antbodies tor theraw- Nature 
S^ TaS » aiit^ies.' a«hough signHicantty better than mouse antibodi^. can Jill el«rt an gj^o^e 
^nse in humans (LoBuglio A. ?.. Wheeler R. H.. Trang J., Haynes A.. Rogers K.. Harvey E. B.. Sun L. QhrayebJ. 
^SinTI. (1989) SouseAiuman chimeric monoclonal antibody in man: Kinetics and immune response. 
S^STsd 86: 4220 - CDR-grafted or reshaped human antibodies contain little or no protein sequences thrt «n 
^JLX. as being deived fr.^ mouse antibodies. Ahhough an antibody humanised by CDR-grafbng may ^1 be 
Se to elicit some immune reactions, such as an anti-allotype or an anli-idiotypic Wnse. as ^^^"^^^'f 
human antibodies, the CDR-grafted antibody will be significantly less unmunogenic than a mouse antibody thus ena 

Ulna a mae prolonged treatment of patients. . . . «.-„^, 

FcS^l AncSher Srious limitation relating to the commercial use of antibodies for f 9"°^^"rSr?DRi^ 
heir produdbility in large amounts. In many instances recombinant expression of natve. '^►'•^"""^^^^^.^T 
aiSib^ies in cell cu Je systems is poor. Factors contributing to poor produability 
sequences and the choice of host cells tor production as well as improper folding andre^^ 
S^Mnlead topoor assembly of heavy and light chains or a transport incompetent conformaton *atfo*rfs seaet«n 
X J^h (S^TtegeSerally aSiepted. thatthe L-chain oonforsthe abilrty of secretion ithe a^^ protein. 
In some instances multiple or even single substitutions can result in the increased produrability of a>^'?s 
ttOTS] Because of the clinical importance of specific immunological targeting m wfroand /n vm> of SP^J^ 
S antigens for diagnosis and therapy in humans, there is a growing need far antibodies thrt 

properties of specific binding to FAP. low immunogenicity in humans, and hflh produaMrty in recombinant systems. 

so Disclosum of the invention 

rocOTl The tedinical problem is solved by the embodiments characterized in the claims. _ 
S The present im/ention provides new antibody proteins having the complementary f 
moTOClonal artlbody F19 (ATCC Accession No. HB 8269). said new antibody proteins specifically bindH^ * ftoroWas^ 
ss Sri^;S7AP).cJaracterisedinthattheyhaveframeworK^^^^^^^ 

in host cete as compared to a chimeric antibody having the vanaWe regions of Fl9 and foreign 
OcS AS used hTein. an "antibody protein" is a protein with the antigen binding specif icjty of a 
0010 -Complementarity determining regions of a monoclonal antibody" are understood to be those ammo acid 
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sequences involved in specific antigen binding accoiding to Kabat (Kat>at E. A.. Wu T. T. Perry H. M Gottesman K S 
and Foeller C. (1991) Sequences of Proteins of Immunological interest (5th Edn). NIH Publication No 91-3242 U S 
Department of Health and Human Sewices, Public Health Service. National Institutes of Health. Bethesda MO ) in con- 
nection with Chothia and Lesk (Chothia and Lesk, J. Mol. Biol.. JSg:S01-9l7 (1987)). 

5 BMMI] As used herein, the term -IramewrkmodificationsVefers to the exchange, deletion or addition 

muftiple amino acids in the variable regions surounding the Individual complementarity determining regiona Framework 
modrfeatons may have an impact on the Immunogenicrty. produdbility or binding specificity of an antibody protein. 
10012] Fibroblast activation protein (FAP)-. also designated fibroblast activation protein alpha (FAP«), is a mem- 

« glycoprotein belonging to the serine protease gene f^ty (WO 97/34927). No shed or secreted fbrm of 

'0 rAP IS known. 

^T?^ "^"^^^'^ by its binding to the monoclonal antibody F19 (F1 9 is obtainable from the hybridoma 

cell line with the accession No. HB 8269 deposited at the ATCC). 

.y^® • activation protein specHic binding' of an antibody protein is defined herein by its ability to 

specrfica^ recogn.se and stably bind FAP-expressing human cells. The binding specificity of the proteins of the inven- 
fton can be determined by standard methods for the evaluation of binding specificity such as described in an exemplary 
fashion in example 6. 8 and exan^e 12. ««iiHiaiy 

Z!! *T ''^"^"^ antibody- refers to an antibody protein having the light and heavy chain variable regions 
as described in figures 17 and 18 and foreign constant regions. .Foreign constant regions" as defined herein are con- 
stant regioris which are different from the constant regions of F19. For comparing an antibody protein of the Invention 
t " ^ understood that such a chimeric antibody must contain the same constant regions as 

said antibody protein. For the purpose of demonstration and comparison alone the human constant heavy and liaht 
chains as described in Figures 19 to 22 are used in an exemplary fashion. 

[0016] To proflde the antibody proteins of the present invention, the nucleic acid sequences of the heavy and light 

[0017] In one embodiment the present invention relates to antibody proteins having the conplementary determining 
re^onsof the monodonalantibodyFl9(ATCC Accession No. HB8269). said new antibody 
to fibroblast activation protein (FAP). characterized in that they have framework modifications resulting in the improved 
producabiMy in host cells as compared to a chimeric antibody having the variable regions of F19 and foreign constant 
reffons. wherein said antibody protein is derived from the murine antibody designated F19 (ATCC Accession No. HB 
8269). 

[0018] To generate humanised FAP-specific antibody proteins a chimeric antibody was constructed, having variable 
regionsof the ligW and heavy chains of F19and human Kght and heavy constant regions, respectively. The construction 
and production of chimeric mouseAiuman antibodies is well known (Boulianne et ai. (1984). referenced above) and 
35 demonstrated in an exemplary fashion in examples land 2. -wwb; ano 

[0019] Therefore. In a further embodiment the invention relates to antibody proteins according to the invention char- 
arterised in that they have a variable Ught chain region and a variable heavy chain region, each joined to a huma^ con- 
Slant region. 

[0020] In particular, the variable region of the light chain was joined to a human kappa constant region and the vari- 
able region of the heavy chain was joined to a human gamma-1 constant region. Other human constant regions for 
humanising light arid heavy chains are also available to the expert. A human kappa and a human gamma-1 constant 
reffons were.used for demonstrating the invention in an exemplary fashion only. 

CWeil Ttierefare. in one particular embodiment the antibody proteins of the invention contain a human kappa con- 
Slant region. — i-r^ 

VMOi Also, in another particular embodiment the antibody proteins of the invention contain a human gamma-1 con- 

stant region. 

One particular .chimeric F19 antibody- protein (cF19) consists of the light and heavy chain variable and con- 

so least 10 fbkJ less ttian most antibodies. 

J" '""«ase expression levels of cF19. the leader sequence of the F1 9 V.regton was changed by 

substitution of Proline to Leucine at position -9. ■o.aouuy 

EL ^i.^"?l"1^o® **** ^ sequence resulted in at least doubling tine amount of chimeric 
•intibody produced in COS cells. For the expression of this particutar chimeric antibody in COS cells the following 
mutated leader sequence of the light chain: MDSQAQVLMLLLLWVSGTCQ. and the following leader sequence of the 
heavy chain: MGWSWVFLFLLSGTAGVLS were used. sequence or me 

[0026] According to the invention ttie term "improved producibility" in host cells refers to the substantial improvement 
of expression levels andftir purified antibody yields when compared with the expression levels and/or antibody yields of 
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a Chimeric antibody wrthout framework modifications as defined abce. Tv«, particular but ""^ "Tf | 
demonstrating improved producibility are exemplified for the COS cell expresswn system (in examples 2 and 5) and for 
the CHO cell expression system (in example 10 and 11). ^. • c^a 

10027] Whllethemutationoftheleadersequenceonlyleadtothedoiilingoftheexpression^eldofU^^^ 
ITbody. a substantial improvement as defined herein refers to an improvement in expression level and/or punf rcahon 

SSaTlnTj^i^Se^Zliment. the invention refers to antibody proteins, characterised in ^^^^^^'^^ 

^incri^enJedia samples asdetermined by EUSAand/or purified antibody yields e^^^ 

purrtication yields of the chimeric antibodies without framework modifications by at least a factor <»«J0- 

rao29] In morepreferred embodiment, the invention refers to antibody proteins, characterised in ttwt their 

[S* in aude riSdia samples as determined by ELISA and/or purified aj^body yidds exceed the e^^^^^" 

and/or purification yields of the chimeric antibodies without framework modificatons by at least a factor 20 _ 

S^llnamost preferred embodiment. ar^lKXly proteins, characterised in thatth«^ 

Sis asdetermined by ELISA and/or purified artibody yields exceed the express«)n levels and/or 

of the chimeric antibodies without framework modrfications by at least a factor of 100. 

[0031 1 improved producability of the recombinant antibody proteins of the invention can be de^ionsfrated for eu^ry- 
Slls ingeneral as shown for COS (cells derived from the kidney of an African green monkey^^and CHO (Chin«e 
SnSer Iry derived ce«s) eucaryotic cells (see examples 5 and 11). In ^^^J^'^JJ^;;^^'' 
Sates to recombinant antibody proteins characterised in that they display improved P^^^^^"^^^^^^^ ^ 
[0W21 In a preferred embodiment the present invention relates to antibody proteins, wherein said eucaryotic cell is a 

'it^fu^^^y four?L certain frameworR.modKK^ns of the light *ain v-;^e regior« ^^^^^^ 
Ihe^nproved produ^lity Jthe antibody proteins of the invention. Three versions of reshaped light chain variable 
regions, designated version A. B. and C. as described in Figures 1 to 6. were prepared. 
10034] UgW chain variable region versions A. B.andCdemonstrate substantially imp^^^ 

sTiamSell). While light chain variable region versionsAandCdiffe^ 

twISmmoI amino acid residues they display an even further substantial '"^"^^^ S?^?^^^^^ 
leak anotherlOfoW difference in antibody secretfon levels between the human reshaped F19iy^ 

versions A or C. Reshaped human F1 9 light chain version A and B only differ in their amino aod se^^^^^^^'V t«to r^- 
idues at positions 36 (Tyr to Phe mutation) and 87 (Tyr to Asp mutation) (nomenclature according » *<abat). "nhis n^- 
StT^onthesecri^ycapabiUty Of amlbodiescontaining the ligW chain var^^^ 

Srect If the Tyr to Asp and Tyr to Phe mutations, considered individually or together, merely causedimproper fokl«ig 

d mfproie^ But t^is uniilLly to be the case «nce antigen binding assays show that '"--r 

F19 li^ chain version B have similar avidities to those paired with F19 light chain vers»n A or C. suggesting that they 

SSie"S^n"rthaped human F19 light chain version B seems particulariy responsible for the reduction of 

secretion vtrtien compared to versions A and C. ^ . . „ j ^ ^ i»«»,*inn 

[0036] in a preferred embodiment, the present inventfon relates to antibody proteins according to the invention, 
wherein the amino acid in Kabat position 87 of the light chain region is not asparagine. 

[00371 in a more preferred entxxliment. the invention relates to antibody proteins according to the inventon. wherein 
the ainino acid in Kabat posHion 87 of the light chain region is selected from aromatic or alphatic amino aads. 
KTn rm^^r^embodiment. thVpresent invention relates to antibody proteins accoiding to the invention, 
wherein the aromatic amino acid in Kabat position 87 of the light chain region is a tyrosine or Phenylalarana 
miS] in a further errtDodiment. the present invention also pertains to antibody proteins according to the invention. 

; wherein the aminoacid in Kabat position 36 of the fight chain region is selected from aromatc amino aads. 

IwSo] ^ 3^ en*od-|,nent the invention relates to the specif fc antibody proteins that may be prepared from 
the individually disclosed reshaped variable regions of the lightand heavy chaiiis. ™-niion 
[0C«1] Especially light chain variable region versions A and C are particularly suitable to pracbce *e ir««nton 
bSiL of Seir exceptionally high producability, while retaining full FAP-binding spoificity and ^^^Jf^rT^ 

) genWty. This hoWs^ecially true when compared to the chimeric antibody having the vanaWe regions of F19 and the 
same constant reotons but also when compared to light chain versions. 

STai T^eSe. in one embodiment the present invention relates to antibody protens that ^^^^^^^^^^ 
Son of the light chain as set forth in SEQ ID NO: 2. In a further embodiment the inventon also relates to ajbodv 
X^is-chLdeSSlinthatthevaria^^ 

' mZ] ^^n 'L embodiment the present invention relates to antibody proteins that contain the variable region of the 
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of the light chain is encoded by a nucleotide sequence as set forth in SEQ ID NO: 5. 

[0045] The present invention also discloses several different variable regions of the heavy chain that work particularly 

well with the variable regions of the light chain versions A and C in terms of Improved producability. 

[0046] In one embodiment the invention relates to antibody proteins containing a variable region of the heavy chain 

5 as set forth in any one of SEQ ID rK)s: 8, 10. 12, 14. 

[0047] In another embodiment the invention relates to antibody proteins characterised in that the variable region of 
the heavy chain is encoded by a nucleotide sequence as set forth in any one of SEQ ID NOs: 7. 9, 11, 13. 
[0048] In a very particular embodiment the invention relates to antibody proteins containing the variable region of the 
light Cham as set forth in SEQ ID NO: 2 and the variable region of the heavy chain as set forth in SEQ ID NOs: 12 

10 [0049] In a further particular embodiment the Invention relates to antibody proteins characterised in that the variable 
region of the light chain is encoded by a nucleotide sequence as set fortii In SEQ ID NO: 1 and the variable region of 
the heavy chain is encoded by a nucleotide sequence as set forth in SEQ ID NO: 1 1 , 

[0050] In a further particular embodiment the invention relates to antibody proteins containing the variable region of 
ttie light chain as set forth in SEQ ID NO: 2 and the variable region of the heavy chain as set forth in SEQ ID NOs: 8 
[0051] In a further particular embodiment the Invention relates to antibody proteins characterised in that the variable 
region of the light chain is encoded by a nucleotide sequence as set forth in SEQ ID NO: 1 and the variable region of 
tiie heavy chain is encoded by a nucleotide sequence as set forth in SEQ ID NO: 7. 

[0052] In a further aspect, the present invention relates to nucleic acid molecules containing the coding information 
for the antibody proteins according to the invention as disclosed above. Preferably, a nucleic acid molecule according 
to the present Invention is a nucleic add molecule containing a nucleotide sequence selected from SEQ ID NOs* 1 3 
5, 7,9, 11. 13. or 15. * ' ' 

[0053] A further aspect of ttie present inventfon is a recombinant DNA vector containing the nucleotide sequence of 
any one of the above-mentioned nucleic acids, e^ecially when said nucleotide sequence is operationally linked to an 
expression control sequence as in expression vectors. Preferred is a recombinant DNA vector, said vector being an 
25 expression vector. 

[0054] A further aspect of the present invention is a host cell carrying a vector as described, especially an expression 
vector. Such a host cell can be a procaryotic or eucaryotic cell. Preferably, such a host cell is a eucaryotic cell, a yeast 
cell, or a mammalian cell. More preferably, sakJ host cell Is an OHO (Chinese hamster ovary) cell or a COS cell. 
[0055] Accordingly, a still further aspect of the present invention is a method of producing antibody proteins according 
30 to the invention. Such a method comprises the steps of: 

(a) cultivating a host ceil as described above under conditions where said antibody protein Is expressed by sakJ 
host cell, and 

(b) Isolating said antibody protein. 

35 

[0056] Mamnralian host cells, preferably CHO or COS cells are preferred. Host cells for producing the antibody pro- 
teins of ttie invention may be transfected witii a single vector containing the expression units (or both, the light and the 
heavy chain. In one particular embodiment the method of producing antibody proteins according to the Invention per- 
tains to host cells, wherein sakl host cells are cotransfected with two plasmids carrying tiie expression units for the light 
40 and heavy chains respectively. 

[0057] The antibody proteins of tiie invention provide a highly specific tool for targeting therapeutic agents to the FAP 
antigen. Therefore, in a further aspect, the invention relates to antibody proteins according to ttie Invention wherein 
said antibody protein is conjugated to a ttierapeutic agent. Of ttie many ttierapeutic agents known In the art. ttierapeutic 
agents selected from the group consisting of radioisotopes, toxins, toxoicte. inflammatogenic agents, enzymes, anti- 
45 sense molecules, peptides, cytokines, and chemottierapeutic agents are preferred. 

[0058] Among ttie radioisotopes gamn^. beta and alpha-emitting radioisotypes may be i^ed as a ttierapeutic agent 
p-emitting radioisotopes are preferred as ttierapeutic radioisotopes. ^^Rhenium. ^^eRhenium. iodine and ^^ttrium 
have been proven to be particularly useful p-emitting Isotopes to achieve localized Irradiation and destruction of malig- 
nant tomor cells. Therefore, radioisotopes selected from ttie group consisting of ^^eRhenium. ^ssRhenium ^3i|ocfine 
50 and Yttrium are particulariy preferred as therapeutic agents conjugataJ to the antibody proteins of ttie invention. 
[0059] A furttier aspect of tiie present invention pertains to antibody proteins according to the invention, characterised 
in ttiat they are labeled. Such an FAP-specific labeled antibody allows for ttie focalisation and/or detection of the FAP 
antigen tn vitro and/or in vivo, A label is defined as a marker that may be directly or indirectiy detectable. An indirect 
marker is defined as a marker ttiat cannot be detected by itself but needs a further directly detectable marker specific 
for ttie indirect mariner. Preferred labels for practicing the invention are detectable markers. From ttie large variety of 
detectable markers, a detectable marker selected from ttie group consisting of enzymes, dyes, radioisotopes, and blotin 
IS most preferred. 

[OCSO] A furttier aspect of ttie present invention relates to antibody proteins according to ttie invention, characterised 
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in that they are conjugated to an imageaWe agent. A large variety o1 innageable agents, especially «««oi80topw. are 
o/eMAe from the slate o the art. For practicing the invention gamma-emitting isotopes are more preferred. Most pre- 
ferred is ^^Sn^iine. 

roosil One aspect of the present invention relates to pharmaceutical compositions containing an antib^y protein 
accading to the present invention as described above and a pharmaceutically acceptable carrier useful for treating 
tumors whereinsaidtumoreareassociated withactivatedstromalfibroblasts. There are t«m possible eHectorpnrK^ 
for an Ui-tumor stroma immunotherapy that may act synergistically: (a) An unmodified (unconjugated^ "a'^dj anti- 
body according to the invention may induce immune destruction or inflammatory reactions in the tumor stroma while (b) 
an antibody conjugated to a therapeutic agent, such as for example, a radioisotope or other toxic substance, may 
achieve localized inadiation and destruction of the malignant tumor cells. ^. ♦ 

[00S21 One futher embodiment are pharmaceutical compositions containing an antibody protein according to the 
invention conjugated to a therapeutic agent as described above and a f^'^^^^"^^'^''^'^!^^^ 
treating tumors, wherein said tumors are associated with activated stromal fibroblasts. Another embodiment perte ns to 
pharmaceutical compositions containing an antibody protein according to the present inventon conjugated to an 
imageaWeagent as described above and a pharmaceutically acceptable carrier useful for imaging the presence of actt- 
vated stromal ftoroblasts in a healing wound, inflamed skin or a tumor, in a human patient A most preferred embodi- 
ment relates to the pharmaceutical compositions mentioned above, wherein said tumors are tumors seeded from the 
cancer group consisting of cotorectal cancers, non-small cell lung cancers, breast cancers, head and neck cancer, ovar- 
ian cancers, lung cancers, invasive bladder cancers, pancreatic cancers and cancers metastatic of the brain. 
r00S3] In an animal or human body, it can proove advantageous to apply ttie phannaceuHcal compoatons as 
described above via an Intravenous or other route. e.g. systemically, locally or topically to the tissue or organ of interest 
depending on the type and origin of the disease or problem treated. e.g. a tumor. For example, a systemic mode of 
arton is desired when different organs or organ systems are in need of treatment as m s.g. systemic airto'mwne ds- 
eases or allergies, or transplantations of foreign organs or tissues, or tumors that are diffuse or difficult to loraiise. A 
local mode of action would be considered when only local manifestations of neoplastic or immunologic acbon are 

expected, such as. tor example local tumors. 

raTO4] The antibody proteins of the present invention may be applied by different routes of application known to the 
expert notably intravenous injection or direW injekUon into target tissues. For systemic application, the intravenous, 
intravascular, intramuscular, intraarterial, intraperitoneal, oral, or intrathecal route are prefen-ed. „ . 

[0C55] A more local application can be effected subcutaneously, intracutaneously. intracardially. intralcbally. 'nframed- 
ullaity. intrapuHnonarily or directly in or near the tissue to be treated (connective-, bone-, muscle-, nerve-, epithili J tis- 
sue) Depending on the desired duration and effectiveness of the treatment pharmaceuttoal antbody compositions 
may be administered once or several times, also intemtiittently. for instance on a daily basis for sei^eral days, weeks or 
nronths and in different dosages. ^ ^ — 

[00861 For preparing suitable antibody preparations for the applications described above, ttie expert may use known 
injectable, physiologically acceptable sterHe solutions. For preparing a ready-to-use solution for parenteral injection or 
infusion, aqueous isotonic solutions, such as e.g. saline or corresponding plasmaprotein solutions are readily avajable^ 
The pharmaceutical compositions may be present as lyophylisates or dry preparations, which can be reconstiWed with 
a known injectable solution directly before use under sterile conditions, e.g. as a Wt of parts. The final preparation of *>e 
antibody conpositions of the present invention are prepared for injection, infusion or perfusion by mixing punhed anti- 
bodies according to the invention with a sterile physiotogically acceptable solution, that may be supplemented with 
known carrier substances or/and additives (e.g. serum albumine. dextirose. sodium bisulfite. EDTA). 
[00S71 The amount of the antibody applied depends on the nature of the disease. ^ ..^.^a,. 

[00881 Furthermore, one aspect of the present invention relates to the use of ttie antibody proteins accwd^to the 
invention for the treatinent of cancer. In a preferred embodiment ttie present invention relates to the use of antibody pro- 
teins according to the imrention conjugated to a therapeutic agent as described above for ttie treatinent d cancerin 
anottier preferred embodiment ttie present invention relates to the use of antibody proteins according 
conjugated to an imageaWe agent for imaging activated stromal fibioblaste. In a further P^?!^^'^^^ 
present invention relates to ttie use of labeled antibody proteins according to ttie invention for detecting the presence 
of activated stromal fibroblasts in a sample. 

[0CS91 One aspect of ttie invention relates to a mettiod of treating tumors, wherein the timor is associated with acti- 
vated stromal fibroblasts capable of specifically fanning a complex with antibody proteins according to the invention, 
present as naked/unmodHied antibodies, modified antibody proteins, such as e.g. fusion proteins, or f*bodyproteins 
conl jaated to a ttierapeutic agent, which comprises contacting the tumor witti an effective amount of said antiljodies. 
; in a preferred embodiment ttie present invention relates to a mettwd of treating tumors as mentioned above. wher«n 
tiie tumor is a tumor having cancer cells selected from the cancer group consisttng of cotorectal cancers, non-small cell 
lung cancers, breast cancers, head and neck cancer, ovarian cancers, lung cancers, invasive bladder cancere. paiwe- 
atic cancers and metastatic cancere of the brain. The method of treating tumors as described above my be effected in 
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in vitro or in vivo. 

[0070] A further aspect of the invention relates to a method of detecting the presence of activated stromal fibrotjiasts 
In wound healing, inflammation or in tumors, characterised in that 

(a) a sample, possibly containing activated stromal fibroblasts, is contacted with an antibody protein accoiding to 
the invention under conditions suitable for the formation of a complex between said antibody and antigen. 

(b) detecting the presence of said complex, thereby detecting the presence of activated stromal fibroblasts in 
wound healing, inflammation or a tumor. 

[0071] In a prefen^ed embodiment, the present invention relates to a method of detecting the presence of activated 
stromal fibroblasts in a tumor, wherein the tumor is a tumor having cancer cells selected from the cancer group consist- 
ing of colorectal cancers, non-small cell lung cancers, breast cancers, head and neck cancer, ovarian cancere lung 
cancers, bladder cancers, pancreatic cancers and metastatic cancers of the brain. Most preferred antibody protons of 
the invention are those which are characterised in that they are labeled as mentioned above. 
[0072] A further aspect of the invention relates to a method of imaging the presence of activated stromal f toroblasts 
in a healing wound, inflamed skin or a tumor, in a human patient, characterised in that 

(a) an antibody protein according to the present invention conjugated to an imageable agent is administered to a 
human patient under conditions suitable for the formation of an antibody-antigen complex, 
20 (b) imaging any complex formed in this manner. 

(c) thereby imaging the presence of activated stromal fbroblasts in a human patient. 

[0073] In a preferred embodiment the present invention relates to a method of imaging the presence of activated stro- 
mal fibroblasts as described above in tumors, wherein the tumor is a tumor having cancer cells selected from the cancer 
25 group consisting of colorectal cancers, non-small cell lung cancers, breast cancers, head and neck cancer, ovarian can- 
cers, lung cancers, bladder cancers, pancreatic cancers and metastatic cancers of the brain. 
[0074] In a further aspect the present Invention relates to a method of detecting tuma-stroma. characterised in that 

(a) a suitable sample is contacted with an antibody protein according to the present invention, under conditions suit- 
so able for the formation of an antibody-antigen complex, 

(b) detecting the presence of any complex so formed, 

/ (c) relating the presence of said complex to the presence of tumor-^roma. 

[0075] Antibody proteins for practicing the invention are preferably labelled with a detectable marker. 
[0076] In a further aspect the present invention relates to a method of imaging tumor-stroma in a human patient which 
comprises 

(a) adminstering to the patient an antibody according to the invention conjugated to an imageable agent as 
desaibed above under conditions suitable for the formation of an antibody-antigen complex. 

(b) imaging any complex so formed, and thereby imaging the presence of tumor-stroma in a human patient. 

Figure legends 

mm 

Fig. 1. DNA sequence of F19 human reshaped light chain variable region version A (hFISL^) SEQ ID NO:t 

Fig. 2. Amino acid sequence ofF19 human reshaped light chain variable region version A (hFigi^) SEQ ID NO: 2. 

Fig. 3. DNA sequence of F19 human reshaped light chain variable region version B (hFWLo) SEQ ID NO- 3 
Nucleotides differing from version A are underlined and in bold type. 

Fig. 4. Amino acid sequence of F19 human reshaped tight chain variable region version B (hFigLe) SEQ ID NO: 
4. Amino acids differing from version A are underlined and in bold type. 

Fig. 5. DNA sequence of F19 human reshaped light chain variable region version 0 (hF19Ln) SEQ ID N0:5 
Nucleotides differing from verston A are underlined and in bold type. 
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Fig. 6. Amino acid sequence of F19 human reshaped light chain variable region version C (hF19Lc) SEQ ID NO: 
6. Amino acids differing from version A are underlined and in bold type. 

Fig. 7. DMA sequence of F19 human reshaped variable region heavy chain version A (hF19H/0 SEQ ID NO: 7. 

Fig. a. Amino acid sequence of F19 human resheped heavy chain variatle region versbn A (hFWH/O SEQ ID NO: 
8 

Fig. 9. DNA sequence of F19 human reshaped heavy chain variable region versbn B (hFigHg) SEQ ID NO: 9, 
Nucleotides differing from version A are underlined and in bold type. 

Fig. iO. Amino acid sequence ofF19 human reshaped heavy chain variable region version B (hFigHg) SEQ ID 
NO: 10. Amino ackJs differing from verston A are underlined and in bold type. 

Fig. 11. DNA sequence of F19 tiuman reshaped heavy chain variable region version C (hF19Hc) SEQ ID NO: 1 1 . 
Nucleotides differing from version A are underlined and in bold type. 

Fig. 12. Amino acid sequence of F19 human reshaped heavy chain variable region version C (hF19Hc) SEQ ID 
NO: 12, Amino acids differing from version A are underiined and in bold type. 

Fig. 13. DNA sequence of F19 human reshaped heavy chain variable region version D (hFigH^j SEQ ID NO: 13. 
Nucleotides differing from version A are underlined and in bold type. 

Rg, 14. Amino acid sequence of F19 human reshaped heavy chain variable region versbn D (hFWHo) SEQ ID 
NO: 14, Amino acids differing from version A are underlined and in bold type. 

Fig. 15. DNA sequence of F19 human reshaped heavy chain variable regbn version E (hFWHE) SEQ ID NO: 15. 
Nucleotides differing from version A are underiined and in bold type. 

Fig. 16. Amino acid sequence of F19 human reshaped heavy chain variable region version E (hFWHE) SEQ ID 
NO: 16. Amino acids differing from version A are underiined and in bold type 

Fig. 17. Amino acid sequence of F19 chimeric light chain variable region (chF19LC) SEQ ID NO: 17. 

Fig. 18. Amino acid sequence of F19 chimeric heavy chain variable region (chFWHC) SEQ ID NO: 18, 

Fig. 19. DNA sequence of human l^appa light constant chain SEQ ID NO: 19. 

Fig. 20. Amino acid sequence of human light constant chain SEQ ID NO: 20. 

Fig. 21. DNA sequence of human heavy constant chain SEQ ID NO: 21. 

Fig. 22. Amino ackj sequence of human heavy constant chain SEQ ID NO: 22, 

Fig. 22. Mammalian cell expression vectors used to produce chimeric and reshaped human antibodies v^ritfi human 
kappa light chains and human gamma-1 heavy chains, 

A. Light chain expression vector: pKNIOO 

B. Heavy chain expression vecta: pQ1D105 

Fig 24. DNA and amino acid sequences of mouse F19 light chain variable region as modified for use in the con- 
struction of chimeric F19 light chain. Restriction sites are indicated by bold letters. The Kozak sequence. CDR's 1 
to 3 and the splice donor site are underlined. 

Fig 25. DNA and amino acid sequences of mouse F19 heavy chain variable region as modified for use in the con- 
stmction of chimerb F19 heavy chain. Restriction sites are indicated by bold letters. The Kozak sequence and the 
splice donor site are underlined. 
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Fig. 26. DNA sequence of F19 chimeric antibody cloned into pKNWO mammalian expression vector. Restriction 
sites are indicated by bold letters and underlined. CDR's 1 to 3 and the splice donor site are underlined. This is the 
DNA sequence of the mouse F19 light chain inside the pKNIOO eukaryotic expression vector. This vector has a 
cDNA version of the human kappa constant region gene (allotype Km(3)) terminated by a strong artificial termina- 
5 tion sequence. In addition, the Neo selection gene is also temiinated by this artificial sequence and is also in the 
same orientation as the kappa light chain expression cassette. 

The essential components of the pKNIOO eukaryotic expression vector are: 





1 -6 


= EcoRI site 


10 


7-1571 


= HCMVi promoter/enhancer 




583-587 


= TATAAbox 




610 


= Start of transcription 




728 - 736 


= Splice donor site 




731 


= Beginning of intron 


IS 


1557 


= ErxJ of intron 




1544- 1558 


= Splice acceptor site 




1590-1598 


= Kbzak sequence 




1599-1658 


s peptide leader sequence 




1659- 1997 


= mouse F1 9 light chain 


20 


1996 - 2004 


~ splice donor site 




2011 -2657 


= cDNA copy of human Kappa constant region (Km(3)) gene 




2664 - 2880 


= Artificial spaC2 termination sequence 




2887 - 7845 


= This is the pSV2neo vector DNA fragment comprising of the Amp-resistance gene (in the oppo- 
site aientation), the CdEI and SV40 origins of replication and the Neo-resistance gene (in the 


25 




same orientation as the HCMVi-KCT cassette) 




7852 - 8068 


= Artificial spaC2 termination signal 



This sequence ends immediately upstream of the EcoRI site (position l -6) at the beginning of the sequence. 
As a vector this DNA sequence would be circular. 

Fig. 27. DNA sequence of F19 chimeric antibody cloned into pgldlOS mammalian expression vector Resti-iction 
sites are indicated by bold letters and underlined. CDR*s 1 to 3 and the splice donor site are underlined. This is tfie 
DNA sequence of the eukaryotic expression vector pQlD105 containing the mouse F19 heavy chain variable 
region. This vector contains a cDNA version of the human gamma-1 oonslanl region (allotype Glm*^®^*). 
The essential components of the construct are: 



1 • 2501 = pBR322 based sequence inducfing Ampicillin resistance gene and ColEI origin plus the SV40 ori- 
gin and the crippled SV40 earty promoter 

2502 - 3226 » dhf r gene 

40 3233 - 4073 = SV40 pdy A sequence etc. 

4074 - 4079 = llgated BamHI and Bglll site (BstYl) 

4080 - 4302 s SPA site plus C2 termination signal 

4303 - 5867 » HCMVi promoter 

5879 - 5885 = unique Hindlll restriction site for cloning of immunoglobulin variable genes 

45 5886 - 5894 = Kbzak sequence 

5895 - 5951 = signal peptide 

5952 - 6323 = mouse F19 heavy chain 

6323 - 6330 = splice donor site 

6331 - 6336 = unique BamHI restriction site for cloning of Immunoglobulin variable genes 

so 6337 - 7388 = cDNA copy of hunnan gamma-1 constant regions preceded by a 62 bp intron 

7389 - 7709 s Arnie termination sequence 



The human gamma-l constant region used in ttiis construct has a Glm'^^'^ allotype which is defined by a 
Glutamic add (E) residue at position 356 (according to Eu numbering) and a Methionine (M) residue at position 358 
(according to Eu numbering). These two residues are underlined in tiie sequence above. 

Fig. 28. PCR-based method for the constmction of human reshaped F19 light chain. This figure provides a sche- 
matic overview of ttie strategy of construction. The dotted lines indicate a complementary sequence of at least 21 
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bases between the primers. 

Ffg. 29. Nucleotide and deduced amino acid sequences of reshaped human F19 light chain variable regions ver- 
sion A S and C Nucleotide and deduced amino acid sequences are aligned and compared with that of version A, 
dashes indicate nucleotide identity, dots indicate amino acid identity with this sequence. Amino acids are numbered 
according to Kabat et ai (1991). The locations of CDRs are indicated in boxes. 

Fig. 30. DMA sequence ofF19L^ (human reshaped light chain version A) cloned into pKNIOO mammalian expres- 
sion vector. Restriction sites are indicated by bold letters and underlined. CDR's 1 to 3 and the splice donor site 
are underlined. This is the DNA sequence of the reshaped F19 light chain version A cloned into pKNIOO eukaryotic 
expression vector. This vector has a cDNA version of the human kappa constant region gene (allotype Km(3)) ter- 
minated by a strong artificial termination sequence. In addition, the Neo selection gene is also tenninated by this 
artif icial sequence and is also in the same orientation as the Kappa light chain expression cassette. 
The components of the vector are: 

7-1571 = HCMVi promoter/enhancer . 

583-587 =TATAAbox. 

610 = Start of transcription. 

728 - 736 - Splice donor site. 

731 = Beginning of intron. 

1557 s End of intron. 

1 544 - 1 558 = Splice acceptor site. 

1 590 - 1 598 = Kozak sequence 

1 599 - 1 658 = peptide leader sequence 

1659 - 1997 = reshaped F19 light chain version A 

1 996 - 2004 = splice donor site 

201 1 - 2657 = cDNA copy of human kappa constant region (Km(3)) gene. 

2664 - 2880 = Artificial spaC2 termination sequence. 

2887 - 7845 = This Is the pSV2neo vector DNA fragment conprising of the Amp-resistance gene (in the oppo- 
site aientation), the ColEI and SV40 origins of replication and the Neo-resistance gene (in the 
same orientation as the HCMVi-KCT cassette). 

7852 - 8068 = Artificial spaC2 termination signal. 

This sequence ends immediately upstream of the EcoRI site (position 1-6) at the beginning of the sequence 
below. As a vector this DNA sequence would be circular. 

Fig. 31. PCR-based method for the construction of human reshaped F19 heavy chain. This figure provides a sche- 
matic overview of the strategy of construction. The dotted lines indicate a complementary sequence of at least 21 
bases between the primers. 

Fig. 32. Nucleotide and deduced amino acid sequences of reshaped human F19 heavy chain variable region ver- 
sions a toe. Nucleotide and deduced amino acid sequences are aligned and compared with that of verston A. 
dashes indicate nudeotkle identity, dots indicate amino ackj identity with this sequence. Amino adds are numbered 
according to Kabat et al. (1991). The location of CDRs is indk»ted by b03^. 

Fig. 33. DNA sequence of F19Ha (human reshaped heavy chain versbn a) cloned into pgldlOS mammalian 
expression vector. Restriction sites are indicated by bold letters and underiined. CDR's 1 to 3 and the splfce donor 
site are underlined. This is the DNA sequence of the eukaryotic expression vector pGIDIOS containing the 
reshaped version A of F1 9 heavy chain variable region. This vector contains a cDNA version of the human gamma- 
1 constant region (allotype Glm'^^"-^. 

The essential components of the construct are: 

1 -2501 = pBR322 based sequence induding Ampicillin resistance gene and CdEI origin plus the SV40 ori- 
gin and the crippled SV40 early promoter 
2502 - 3226 ^ dhfr gene 
3233 - 4073 = SV40 pdy A sequence etc. 
4080 - 4302 = SPA site plus C2 termination signal 
4303 - 5867 = HCMVi promoter/enhancer 
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5879 - 5885 =: unique Hindlll restriction site for dcning of immunoglobulin variable genes 

5886 - 5894 = Kbzak sequence 

5895 - 5951 » signal peptide 

5952 - 6323 = reshaped F1 9 heavy chain version A . 

5 6323 - 6330 = splice donor site 

6331 - 6336 = unique BamHI restriction site for cloning of immunoglobulin variable genes 

6337 - 7388 = cDNA copy of human gamma-1 constant regions preceded by a 62 bp intron 

7389 - 7709 = Amie termination sequence 

10 The human gamma-1 constant region used in this construct has a Glm^*'""* allotype which is defined by a 

Glutamic add (E) residue at position 356 (according to Eu numbering) and a Methionine (M) residue at position 358 
(according to Eu numbering). These two residues are underlined in the sequence above. 

Fig. 34. Heavy (panel A) and light (panel B) chains RNA splicing e\/ents taking place during anybody F19 expres- 
15 sion in mammalian cells - schematic overview. 

A. Heavy chain RNA splidng 

a Kappa light chain RNA splidng 

20 

Fig. 35. Concentration dependence of L/^Hq supernatant binding to CDS-FAR 
Fig. 36. Binding of bbtinylated L^Hc to human FAR 
25 Rg. 37. CDd'FAP carries the F19 epitope as detected with cF19. 
Examples 

Example 1: Construction of mouse - human chimeric genes 

. 30 

[0078] The chimeric F19 (cF19) antibody was designed to have the mouse F19 V|_ and Vh regions linked to human 
kappa and gamma-1 constant regions, respectively. PGR primers w&re used to modify the 5'- and 3'- sequences flank- 
ing the cDNA sequences coding for the mouse F19 Vl and Vh regions (Tat>le 1). PGR primers specific for F19 light 
chain V-region were designed. These adapted mouse F19 variable regions were then subcloned into mammalian cell 
35 expression vectors already containing the human kappa (pKNIOO vector) or ganvna-l (pG1D105 vector) constant 
regions (Figure 23). 

[Q079] These vectors employ the human cytomegalovirus (HCMV) promoter/enhancer to efficiently transcribe the light 
and heavy chains. The vectors also contain the SV40 origin of replication to permit efficient DNA replication and sub- 
sequent protein expression in cos cells. The expression vectors were designed to have the variaisle regions inserted as 

40 Hindlll-BamHI DNA fragments. PGR primers were designed to introduce these restrictions sites at the 5'- (Hindlll) and 
3'- (BamHI) ends of the cDNAs coding for the V-regions. In additton the PGR primers were designed to introduce the 
Kozak sequence (GGGGGGAGG) at the 5'-ends of both the light and heavy chain cDNAs to allow effident translation 
(Kbzak M.: At least six nudeotides preceding the AUG initiator codon enhance translation in mammalian cells. 1 Mol. 
Biol. (1987) 196: 947). and to introduce splice donor sites at the 3'-ends of both the light and heavy chain cDNAs for 

45 the variable regions to be spliced to the constant regions. The PGR primers used in the construction of the chimeric F1 9 
light and heavy chains are shown in Table 1 . The DNA and amino acid sequences of the mouse F19 and Vh regions 
as adapted for use in the construction of chimeric F1 9 light and heavy chains are shown in Figures 24 and 25. The DNA 
sequences of mouse F19 light and heavy chains cloned into the eukaryotic expression vectors pKNIOO and pG1D105. 
respectively, are shown in Figures 26 and 27. 

50 
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TABLE 1 : PGR primers for the construction of chimeric F1 9 antibody. 

A Li ght chain variable region 

1 . Primer for the constmclion of the 5'-end (37mer) 

5' CAGA AA6CTT GCCGCCACC ATG GAT TCA CAG GCC CAG 3' 
Hindlll Kozak seouence M D S Q A Q 

2. Primer for the construction of the 3'-end (35mer) 

5' CCGA GGATCC ACTCACGTTT CAG CTC CAG CTT GGT 3' 
BamHI Solice donor site 

B. Heavy chain variable reoion 

1 . Primer for the construction of the 5'-end (37mer) 
S CAGA AAGCTT GCCGCCACC ATG GGA TGG AGC TGG GTC 3* 
Hindlll Kozak seouence M G W S W V 



2. Primer for the construction of the 3*-end (35mer) 
S CCGA GGATCC ACTCACC T GA GGA GAC GGT GAC TGA 3' 
BamHI Solice donor site 



Example 2: Expression and binding activity of ctiimeric F19 antibody 

[0080] The two plasmid DMAs coding for the chimeric F1 9 light and heavy chains (see exanple 1 ) were co-transfected 
into cos cells to look tor transient expression ot chimeric F19 antibody as described below. After 72 h inciAwtion. the 
medium was collected, centrlfuged to remove cellular debris, and analysed by ELISA for the production of a human 
IgGMike antibody. The cos cell supernatant containing the chimeric F1 9 antibody was analysed for its ability to bind to 
HT 1080 cells (see example 13) expressing the FAP antigen on their surface. 

Transfection of cos cells us ing electroDoratlon 

[0081 ] The mammalian expression vectors pg1d105 and pKN1 00 containing the chimeric or reshaped human heavy 
and light chains versions, respectively, were tested in cos cells to look for transient expresston of F19 antibodies. Cos 
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7 cells were passaged routinely in DMEM (GIbco BRL cat #41966) containing penicillin (50 lU/mQ, streptomycin 
(50jig/ml). L-glLrtamine and 10% heat-inactivated gamnia globulin-free foetal calf serum (FCS, Harlan Sera-Lab cat. # 
D0001), The DNA was introduced into the cos cells by electroporation using the Gene Pulsar apparatus (BioRad). DNA 
(10|ig of each vector) was added to a 0.8ml aliquot of 1x10^ cella/ml in Phosphate-buffered saline (PBS. Ca^* and Mg2+ 
5 free). A pulse was delivered at 1 .900 volts. 25jiF capacitance. After a 10 min recovery period at ambient temperature 
the electroporated cells were added to 8 ml of DMEM containing 5% FCS. After 72h incubation at 37«C, the medium 
was collected, centrifuged to remove cellular debris, and stored under sterile conditions at 4*»C for short periods of time, 
or at -20^0 for longer periods. 

EllSA me thod for measuring assemt^ed laGiykaopa antibodv concentrations in ens cell supernatants 

[0082] Samples of antibodies produced in transfected cos cells were assayed by ELISA to determine how much 
reshaped human antibody had been produced. For the detection of human antibody, plates were coated with goat anti- 
human IgG (Fcir fragment specific) antibody (Jackson ImmunoResearch Laboratories Inc.. #109-005-098). The sam- 

15 pies from cos cells were serially diluted and added to each well. After incubation for 1 h at 37'^C and washing, horserad- 
ish peroxidase conjugated goat anti-human kappa light chain (Sigma. A-7164) was added. After incubation for 30 mins 
at 37*^0 and washing. K-blue substrate (mixer of 3.3\5,5* tetramethylbenzidine and hydrogen peroxide. Bionostics Um- 
ited. #KB175) was added. After standing at room temperature for 30 mins. the reaction was stopped using Red Stop 
solution (Bionostics Limited. #RS20) and the optical density read on a microplate reader at 650 nm. Purified human 

20 IgGl/Kappa antibody (Sigma, 1-3889) of known concentration was used as a standard. 

[0083] The expression of chimeric F1 9 anttoody in COS cells was poor (Table 2), between 10 and 60 ng/ml which is 
at least 10 fold less than most antibodies. 

[0084] In an attempt to increase expression levels of the chimeric F1 9 antibody, the leader sequence of F1 9 V|_ region 
was changed by substitution of Leucine to Proline at position -9. This single change in amino acid in the leader 
25 sequence resulted in at least doubling the amount of chimeric antibody produced in COS cells. 

[0085] The test results show that chimeric F19 binds specifically and with the expected avidity to the FAP target. 



TABLE 2 



Chimeric F1 9 antibody concentrations In COS ceil supernatants 
(These are the lesults off three independent transf^ions) 


Transfected Antibody corrponents 


Human y1/K 


Heavy chain 


Kappa light chain 


[in ^g/ml] 


cF19 


cF19 (F19 leader sequence) 


0.060 


cF19 


cF19 (mutated leader sequence) 


0.212 


cF19 


cF19 (F19 leader sequence) 


0.056 


cF19 


cF19 (mutated leader sequence) 


0.108 


cF19 


cF19 (F19 leader sequence) 


0.011 


CF19 


cFl9 (mutated leader sequence) 


0.087 



45 

Example 3: Construction of the reshaped human F19 light chain versions a to c (La-Lb) 

[0086] The construction of the first version of reshaped human F19 V|_region (La) was earned out using overlapping 
PGR fragments in a method similar to ttiat descrflDed by Daugherty B. L, DeMartino J. A., Law M. F. Kawka D. W., 

50 Singer 1. 1, and Mark G. E. (1991) Polymerase chain reaction facilitates the cloning, CDR-grafting, and rapid expression 
of a murine monoclonal antibody directed against the CD18 component of leukocyte integrins. Nucl. Adds Res. 19: 
2471 Ten oligonucleotides were syrthesised ttiat consisted of five primer pairs. APCR1 -vial , vla2-vla3. vla4-vla5, vla6- 
vla7. and vla8-APCR4 (Table 3 and Figure 28). There was an overlapping sequence of at least 21 bases between adja- 
cent pairs (Figure 28). APCR1 and APCR4 hybridised to the flanking pUC1 9 vector sequences. The mutagenic primers 

55 were designed such that their 5' end immediately followed the wobble position of a codon. This strategy was used to 
counteract tiie gratuitous addition of one nucleotide to the 3' end of the strand complementary to the mutagenic primer 
by the DNA polymerase during PGR (Shan-ocks A. D. and Shaw R E. (1992) Improved primer design for PCR-based. 
site-directed mutagenesis. Nud. Adds Res, 20: 1147). The appropriate primer pairs (0.2jiM of each) were combined 
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with lOng of version l)" of reshaped human L25Vl region cDNA. and 1 unit of AmpliTaq (PerWn Elmer Cetus) DNA 
polymerase in 50^l of PGR buffer containing 10mM Tris-HCI (pH8.3). 50mM KCI, 200^f^ dNTPs. and 1.5mM MgClg. 
This was overlaid with mineral oil md PGR was perfamed for 25 cycles, each cycle consisting of a denaturation step 
at 94'>C for 1 min. a primer annealing step at 55«G for 1 min. and an extension step at 72«C for 2 mins. This was followed 

5 by a single cycle consisting of a further elongation step at 72*»C for 1 0 mins followed by cooling to 4»C. The ramp time 
between the primer-annealing and extension steps was 2.5 mins. The PGR products of the five reactions (A. B. G, D 
and E) were then purified by gel electrophoresis followed by DNA elution using Wizard PGR preps (Promega). PGR 
products A, a D, and E were assembled by their complementarity to one another. In the second set of PGR reac- 
tions. PGR products B and G. and D and E, (50ng of each) were added to 50^1 PGR reactions (as described above) 

10 each containing 1 unit of AmpliTaq (PerWn Elmer Getus) DNA polymerase. The reactions were cycled for 20 cycles as 
described above with the exception that the annealing temperature was raised to 60^G. In the third set of PGR reac- 
ttons, PGR products F and G were PGR-ampllfied using 1 ^1 of each prior PGR reaction and the appropriate pair of PGR 
primers (vla2-vla5 or vla6-APGR4). The PGR reactions contained 1 unit of AmpliTaq DNA polymerase in 50 ^il PGR 
reaction (as described above) and w^e amplified tor 25 cycles as in the first stage. In the fourth set of PGR reactions. 

,5 the PGR product H was PCR-amplHied using 1 pi of each prior PGR reaction and the vla2-APGR4 pair of PGR primers. 
Finally. PGR products A and H were assembled by their own complementarity in a two step-PGR reaction similar to that 
desaibed above using RSP and UP as the terminal primers. The fully assembled fragment representing the entire 
reshaped human F19 V^region including a leader sequence was digested with Hindlll and BamHI and cloned into 
pUG1 9 for sequencing. A clone having the correct DNA sequence was designated reshFI 9La (Figure 29) and w^ then 

20 subdoned into the eukaryotic expression vector pKNIOO. The DNA sequence of reshF19U cloned into pKNIOO is 
shown in Figure 30. 

[0087] The second version of reshaped human F19 Vtregion (Lb) was constructed using the same scheme as that 
described for U but where vla4 and vla7 primers were substituted by vlb4 and vlb7 respectively (Table 3). The DNA 
sequence of Lb is shown in Figure 29. 

25 [0088] The third version of reshaped human F19 Vjegion (Lc) was constructed using the QuikChange site-directed 
mutagenesis kit from Stratagene. The QuikChange site-directed mutagenesis method was performed according to the 
manufacturer s instructions, using reshF19U in pKNIOO vector as double stranded DNA template. The mutagenic oli- 
gonucleotide primers Fl9Lc-sense and F19Lc-antisense (Table 3) for use in this protocol were designed according to 
the manufacturers instructions. Briefly, both the mutagenic primers contained the desired point mutation (codon TTT at 

30 Kabat residue position 49 (Phe) changed to TAT coding for Tyr) and annealed to the same sequence on opposite 
strands of U in pKNIOO vector. The point mutation was verified by DNA sequencing the entire Vl region. The DNA 
sequence of U is shown in Figure 29. To eliminate the possibility that random mutations occunred in the pKNIOO during 
the PGR reaction, the Vtregion was cut out of the pKNIOO vector as an HindllimamHI fragment and re-subcfoned into 
an unmodified pKNIOO vector cut with the same two restriction enzymes beforehand. 
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TABLE 3: PCR primers for the construction of reshaped human F19 light chain 
variaiile regions 

1 . Primers for the synthesis of version "a" 

F19vla1 (36mer): 

5' GTCATCACAATGTCTCCGGAGGAACCTGGAACCCAG 3' 
F19vla2 (29 mer): 

5* CTCCGGAGACATTGTGATGACCCAATCTC 3" 
F19vla3 (45 mer): 

5* GAATATAAAAGGCTCTGACTGGACTTGCAGTTGATGGTGGCCCTC 3' 
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F19vla4 (72 mer): 

5'CAGTCAGAGCCTTTTATATTCTAGAAATCAAAAGAACTACTTGGCCTGGTAT 
CAGCAGAAACCAGGACAGCC 3' 

F19vla5 (44 mer): 

5* ACCCCAGATTCCCTA6TGCTAGCCCAAAAGATGAGGAGTTTGGG 3* 
F19vla6 (67 mer): 

S'TAGCACTAGGGAATCTGGGGTACCTGATAGGTTCAGTGGCAGTGGGTTTG 
GGACAGACTTCACCCTC 3' 

F19vla7 (53 mer): 

5'GTCCCTTGTCCGAACGTGAGCGGATAGCTAAAATATTGCTGACAGTAA 
TAAAC3' 

F19vla8(33mer): 

5' GCTCACGTTCGGACAAGGGACCAAGGTGGAAAT 3' 

2. Primers for the synthesis of version "b" 
F19vlb4 (72 mer): 

5'CAGTCAGAGCCTTTTATATTCTAGAAATCAAAAGAACTACTTGGCCTGG 
TTCCAGCAGAAACCAGGACAGCC 3' 

F19vlb7(57mer): 

6'GTCCCTTGTCCGAACGTGAGCGGATAGCTAAAATATTGCTGACAGTCATA 
AACTGCC 3- 

3. Primers for the synthesis of version 'c' 
F19Lc-sense (34 mer): 

5' CCCAAACTCCTCATCTATTGGGCTAGCACTAGGG 3' 
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F19Lc-anti8ense (34 mer): 

5' CCCTAGTGCTAGCCCAATAGATGAGGAGTTTGGG 3* 

4. Primers hybridizing to the flanking PUC1 9 vector sequences 

APCR1 (17 mer, sense primer): 5' TACGCAAACCGCCTCTC 3' 

APCR4 (18 mer. anti-sense primer): 5* GAGTGCACCATATGCGGT 3 

RSP (-24) (16 mer. sense primer): 5* AACAGCTATGACCATG 3' 

UP (-40) (1 7 mer. anti-sense primer): 5' GTTTTCCCAGTCACGAC 3' 



Example 4: Construction of the reshaped huntan F19 heavy chain versions a to e (Ha-He) 



[0089] Version "a" of reshaped human F19 Vh regions (Ha) was constructed using the same PGR methods as 
described for the construction of version "a" of reshaped human F1 9 V^region (La) (Figure 31). The template DMA was 
version "a" of reshaped human 226 Vh (L6ger O. J. P.. Yednock T. A.. Tanner L, Horner H. C, Hines D. K., Keen S., 
SaWanha J., Jones T. Fritz L C. and Bendig M. M. (1997). Humanization of a mouse antibody against human alpha-4 

30 integrin: a potential therapeutic for the treatment of multiple sclerosis. Hum. Antibod. 8: 3). Six PGR primers were 
designed and synthesized for the construction of version "a" of reshaped human F19 Vh region (Table 4). PCRproducts 
A, B, G. and D were obtained using APGRI-Vhal. Vha2-Vha3. Vha4-Vha5 and Vha6-APGR4 as PGR primer pairs, 
respectively. The PGR conditions were essentially as described for the construction of reshaped human PI 9 V^^region. 
A clone having the conrect DNA sequence was designated reshF19Ha (Rgure 32) and was then subcloned into the 

35 eukaryotic expression vector pGI D105. The DNA sequence of reshF19Ha cloned into pGIDIOS is shown in Figure 33. 
[0090] The third version of reshaped human F19 Vh region (He) was constructed using the same scheme as that 
described for Ha but where Vha4 primer was substituted by Vhc4 (Table 4). The DNA sequence of He is shown in Figure 
32. The second (Hb) and fourth (Hd) version of reshaped human F19 V^ region were constructed based on the PCR- 
mutagenesis methods of Kamman et al. (Kamman M.. l^ufs J., Schell J. and Gronenborn B. (1989) Rapid insertional 

40 mutagenesis of DNA by polymerase chain reaction (PGR), Nucl. Acids Res. 17: 5404). For Hb and Hd. a mutagenic 
primer F19VHbd6 (Tyr-91 to Phe-91 , Table 4) was used paired with APCR4 in PGR reactions with Ha and He as the 
template DNA. respectively. The PGR products VHb and VHd were restriction enzyme digested with PstI and BamHI 
and subcloned into reshF19Ha and reshF19Hc. respectively, previously digested with the same two restriction 
enzymes. The DNA sequences of Hb and Hd are shown in Figure 32. 

45 [0091 ] Version e of reshaped human F1 9 Vh region (He) was constructed based on the PGR-mutagenesis methods 
of Kamman et al. (1 989) already mentioned above: 

[0092] For reshF19He mutagenic primer F19MsclHe (Table 5) was used paired with primer F19VHHindHI (Table 5) in 
PGR reactions with He cloned in pgldlOS mammalian expression vector as the template DNA. The appropriate primer 
pairs {0.2\iM of each) were combined with 10ng of cDNA of version "a" of reshaped human 226 Vh region in lOOpi of 

50 PGR buffer containing lOmM KGI, lOmM (NH4)2S04, 20mM Tris-HGI (pH 8.8) 2mM IV!gS04, 0.1% Triton X-100 and 
200^ dNTPs. Reaction mixtures were overlaid with mineral oil and Kept at 94*G for 5 mine. Then 1 unit of Deep Vent 
DNA polymerase (New England Biolabs) was added ("Hot Start" PGR; Ghou Q., Russell M.. Birch D., Raymond J. and 
Bloch W. (1992) Prevention of pre-PGR mis-priming and primer dimerization improves low-copy-number amplifications. 
Nucl. Acids Res. 20: 1717) and PGR was performed for 25 cycles on a TRIO-Thermoblock Thermal Gycler (Biometra. 

55 GOttingen, Germany). Each cycle consisting of a denaturation step at 94*'G for 1 min, a primer annealing step at 70**G 
for 1 min. and an extension step at 72''G for 2 mins. This was followed by a single cycle consisting of a further elongation 
step at 72**G for 10 mitts followed by cooling at 4*»G. The PGR products were then extracted and purified from a TAE 
1.4% standard agarose gel using a QIAquick^" gel extraction kit. following the protocol supplied by the manufacturer 
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(QIAGEN Ltd.. UK). The PGR product W^e was then restriction enzyme digested with MscI and Hindlil and ligated into 
reshF19Hc cloned in pg1d105 previously digested with the same two restriction enzymes. The MscI restriction recog- 
nition site l8 unique to all the reshaped human F19 Vh region versions and is not present In the pg1d105 expression 
vector. The Hindlil restriction recognition site is a unique site in pg1d105 for clotting of Vh immunoglobulin genes. 

5 [0093] Electroporation-competent XL-1 Blue E. coli cells were transformed with 1 ^1 of the ligated DNA and plated on 
agarose plates containing Ampicillin. Colonies were then screened for the presence and correct size of inserts by direct 
PGR on colonies (GOssow D. and Clackson T (1989) Direct done characterization from plaques and colonies by the 
polymerase chain reaction. Nucl. Acids Res. 17: 4000) with primers HCMi and Huoyl hybridising to the flanking 
pg1d105 vector sequences (Table 5), DNA from positive colonies was prepared using a Plasmid Midi Wt, following the 

TO protocol supplied by the manufacturer (QIAGEN Ltd., UK). DNA sequendng was performed by the dideoxy chain termi- 
nation method (Sanger P.. Nidden S. and Goulson A, (1977) DNA sequendng with chain-terminating inhibitors. Proc. 
natn. Acad. ScL U S. A. 74: 5463) directly from drcular vector DNA using conventional heat denaturation (Andersen 
A.. Pettersson A. and KiekJsen T (1992) A fast and simple technique for sequendng plasmid DNA with sequenase 
using heat denaturation. Biotechniques 13: 678 ) and Sequenase 2.0 (USB. Cleveland. OH). The DNA sequences of 

75 reshPI 9He is shown in Figure 32. 

TABLE 4: PGR primers for the construction of reshaped human F19 heavy 
chain variable regions versions a to d. 

20 

1 Primers for the synthesis of version "a" 

25 

F19vha1 (47mer): 

5' GTGTATTCAGTGAAGGTGTATCTACTAGTTTTACAGCTGACTTTCAC 3' 

30 

F19vha2 (53mer): 

5' TAGTAGATACACCTTCACT6AATACACCATACACTGGGTTAGACAGG 
CCCCTGy 
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F19vha3 (71 mer): 

5' CCCTTGAACTTCTGGTTGTAGTTAGGAATACCATTGTTAGGATTAATACC 
TCCTATCCACTCCAGCCTTTG 3' 

F19vha4 (71 mer): 

5'TAACTACAACCAGAAGTTCAAGGGCCGGGCCACCTTGACCGTAGGCAA 
GTCTGCCAGCACCGCCTACATGG 3' 

F19vha5(63mer): 

5'GCATGGCCCTCGTCGTAACCATAGGCGATTCTTCTTCTGGCGCAGTAGT 
AGACTGCAGTGTCC 3' 

F19vha6(48 mer): 

5' CTATGGTTACGACGAGGGCCATGCTATGGACTACTGGGGTCAAGGAAC 3' 

2. Primers for the synthesis of version "c" 
F19vhc4(71 mer): 

5TAACTACAACCAGAAGTTCAAGGGCCGGGTCACCATCACCGTAGACA 
CCTCTGCCAGCACCGCCTACATGG 3* 

3. Primers for the synthesis of version "b" and "d" 
F19vhbd6(27mer): 

5' GGACACT6CAGTCTACTTCTGCGCCAG 3' 



4. Primers hybridizing to the flanking PUC19 vector sequences 

APCR1 ( 1 7 mer. sense primer): 5' TACGCAAACCGCCTCTC 3' 

APCR4 ( 1 8 mer, anti-sense primer): 5" GAGTGCACCATATGCGGT 3' 



EP0953638A1 

TABLE 5: PCR primer for the construction of reshaped human F1 9 heavy 
chain variabie regions version e 

1 ■ Primer for the synthesis of version "e" 

F19IMscil^e (65 mer. anti-sense): 

5'CCT TTGGCCAG GGGCCTGTCTAACCCAGTGTATGGTGTATTCAGTGAAGGTG 
Msd 

TATCCACTAGTTTCCACTAGTTT 3' 

2. Primers hybridizing to the flanking dq1cI105 mammalian e xpression vector sequences 

HCMi (28 mer, sense): 5* GTCACCGTCCTTGACACGCGTCTCGGGA 3' 
Hucy1 (17 mer. anti-sense): 5' TTGGAGGAGGGTGCCAG 3' 



Example 5: Reshaped human F19 antibody concentrations in COS cells supematants 

[0094] COS cells were transtected with one pair of a series ol reshaped human F19 antibody constructs and Ihe 
human antibody concentration was measured using the igOI/Kappa ELISA as described in example 2. 



TABLES 



Reshaped human F19 antibody concentrations in COS 
ceil supernatants 


TVansfected Antibody compo- 
nents 


Human yl/K 


Heavy chain 


Kappa light chain 


concentration [^g/nrtl] 


Ha 


La 


2.50 


Ha 


Lb 


0.18 


Hb 


La 


1.25 


Hb 




0.10 


Hd 


La 


1.15 


Hd 


Lb 


0.18 


Ha 


La 


1.50 


Ha 


Lc 


1.56 
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lABLE 6 (continued) 



Reshaped human F19 antibody concentrations in COS 
ceil supernatants 


Transf ected Antll)ody compo- 
nents 


Hiimsin v1/lf 

nUllHIII J MIW\ 


Heavy chain 


Kappa light chain 


concentration {jig/ml] 


He 


La 


1.47 


He 


Lc 


1.97 


CF19 


U 


1.54 


CF19 


Lb 


0.07 


cF19 


Lc 


2.14 



TABLE 7 



Reshaped human F19 antibody concentrations in COS 
ceil superratants 


'n-ansfected Antibody compo- 
nents 


Human yl/K . 


lieavy chain 


Kappa light chain 


concentration [fig/mi] 


Ha 


La 


2.00 


Ha 


Lc 


2.50 


He 


La 


2.90 


He 


Lc 


3-00 


He 


La 


2.80 


He 


Lc 


3.50 



RNA splicinq events required for the expression of immunoatobulin oanes in mammalian calls 

[0095] Both mammalian expression vectors pKNIOO and pg1d105 have an intron between the variable and the con- 
40 start regions which is removed during the process of gene expression to give rise to an messenger RNA. The splicing 
evert which consists of a DN A recombination between the heavy or light chain splice donor sites and the immunoglob- 
ulin splice acceptor site is desaibed in Figure 34. 

Example 6: How cytometric analysis of the binding of cFl9 and LJ^c to FAP-expressing human ceils 

45 

[0096] The ability of LaHq to bind to both recombinart and endogenously expressed FAR on cell surface was tested. 
[0097] The example was conducted to determine the binding of LaHq to cellular FAR Both naturally FAR expressing 
MF-SH human tumour cells and FAP-transfected human tumour cell lines were used as cellular targets. LaHo was stud- 
ied in cytofluorometric assays evaluating direct tMnding to target celts as well as by the inhibitory effect on the binding 

50 of either murine F1 9 or chimeric cFI 9 arti-FAP antibodies. 

[0098] Antibodies and cell lines used were F19 (mirine monoclonal arti-human FAR antibody, IgGi subclass). mIgG 
(murine immunoglobulin, IgQ class). cF19 (chimeric monoclonal anti-human FAR antibody, IgGi subclass), LaHq 
(reshaped monoclonal anti-human FAR artibody, IgGi subclass). hIgQI (human immunoglobulin. lgG1 subclass), MF- 
SH (human malignart fibrous histiocytoma cell line), HT-1080 (human fibrosarcoma cell line), HT-1080FAR done 33 

55 (HT 1 080 cell line transf ected with cDNA encoding human FAR) 
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Direct bindino of L^ H^ to FAP on the surface of human tumour cell lines 

[0099] 5x10^ cells of the tumour cell line under investigation were incubated with the indicated concentration of test 
or control antbody in a total volume of 0.2 ml phosphate-buffered saline (PBS) supplemented with 1% bovine serum 
5 albumin (BSA) for 30 min on ice. 

[0100] Subsequently, cells were washed twice with 2 ml of PBS. resuspended in 0.2 ml of PBS supplemented with 
1% BSA. the appropriate anti-lg-antibody as secondary reagent (either a 1 :20 dilution of goat anti-mouse Ig FITC- 
lab^ed [Dianova] or a 1 :20 dilution of mouse anti-human IgG FITC-labeled [Dianova]) and incubated for another 30 min 
on ice. 

10 [0101] Cells were again washed twice with 2 ml of PBS. resuspended in a total volume of 0.5 ml of PBS supplemented 
with 1% paraformaldehyde (PFA) and kept on ice. Single cell fluorescence was determined cytoftuorometrically by ana- 
lysing the cellular green fluorescence in the 488nm light of an EPICS XL (Coulter). 

Inhf^jfnr y effect of L^H^ on Nndinp of biotinylated cF1 9 to FAP on the surface of human cell lines 

IS 

[0102] 5x10^ cells of the tumour cell line under investigation were Incubated with the indicated concentration of the 
biotin-labelled antibody in a total volume of 0.2 ml PBS supplemented with 1% BSA and the simultaneously added unla- 
belled test or control antibody for 30 min on ice. Subsequently, cells were washed twice with 2 ml of PBS. resuspended 
in 0.2 ml of PBS supplemented with 1% BSA, 1 :40 diluted streptavidin-FITC (Dianova) as secondary reagent and incu- 

20 taxed for another 30 min on ice. 

[0103] Altemativety, cells were incubated with the indicated concentrations of murine F19 and cell-bound antibody 
detected via 1 :20 diluted goat anti-mouse Ig labelled with FITC by comparable incut)ation steps. 
[0104] In each case, cells were finally washed twice with 2 ml of PBS. resuspended in a total volume of 0.5 ml PBS 
supplemented with 1% PFA and kept on ice. Single cell fluorescence was detemiined cytofluorometricaOy by analysing 

25 the cellular green fluorescence in the 488nm light of an EPICS XL (Coulter). 

[0105] Both. cFl9 and LaHq bind in a concentratton dependent manner specifically to to FAP-transfected HT- 
1080FAP clone33 human tumour cells (Table 8). No binding toFAP-negative HT-1080 cells was detectable (Table 9). 
Both cFi9 and LaHc bound in a concentration dependent manner to human f^F-SH cells endogenously expressing 
FAP (Table 10). 

30 [0106] Biotinylated cF19 in a concentration dependent manner bound to human HT-1 080FAP clone 33 (Table 1 1 ). No 
b'nding was detectable to FAP-negative HT-1080 cells (Table 12). 

[0107] Binding of biotinylated cF19 to HT-1080FAP clone 33 cells was inhibited by both unlabelled cF19 and unla- 
belledLAHc (Table 13). 

[0108] Chimeric anti-human FAP monoclonal antibody cF19 as well as reshaped human anti-human FAP monoclonal 
35 antibody LaHq (example 10) were shown to bind directly to FAP expressed on human cell lines either endogenously 
expressing this protein or transfected with cDNA encoding for rt. This binding was shown to be concentration depend- 
ent. Binding of biotinylated cF19 couW be inhibited by both unlabelled cF19 and unlabelled LaHq 
[0109] Using cytofluorometric technology, direct binding as well as inhibition of specifically binding ragents showed 
specificity of chimeric cF19 and reshaped LaHq human monock)nal antibodies to cell surface expressed FAP. 

40 



Tables 



Binding of anti-FAP antibodies to HT-1080FAP done 33 
ceils 


Concentration of anti- 
body 


Mean fluorescence Intensity 


[ng/mL] 


hlgG1 


CF19 


LaHc 


500.0 


0.12 


6.65 


2.76 


100.0 


0.12 


1.63 


0.66 


20.0 


0.12 


0.43 


0.22 


4.0 


0.12 


0.17 


0.15 


0.8 


0.12 


0.14 


0.13 
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Tables 



Binding of anti-FAP antibodies to non-transffected HT- 
lOSOcelis 


Concentration of anti- 
Ixxly 


Mean fluorescence intensity 


[ng/hfiL] 


hIgQl 


cF19 


LaHc 


500.0 


0.11 


0.11 


0.12 


100.0 


0.11 


0.11 


0.11 


20.0 


0.11 


0.11 


0.12 


4.0 


0.11 


0.11 


0.12 


0.8 


0.11 


0.11 


0.11 



20 
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Binding of anti-FAP antibodies to NIF-SH ceils 


Concentration of anti- 
body 


Mean fluorescence intensity 


[ng/mL] 


hIgQI 


cF19 


LaHc 


4.0 


0.6 


3.6 


2.8 


2.0 


n.d. 


3.3 


2.5 


1.0 


n.d. 


2.4 


1.9 


0.5 


n.d. 


1.8 


1.3 



n.d.: not done 

3S 



Table 11 



Binding of biotinyiated cFI 9 antibody to HT-1080FAP done 33 

cells 


Concentration of anti- 
body 


Mean fluorescence intensity 


[ng/mq 


Biotinyiated hlgGl 


Biotinyiated cFt9 


5.000,0 


0.2 


36.5 


1.000.0 


0.2 


18.1 


200.0 


0.2 


4,5 


40.0 


0.2 


1.3 


8.0 


0.2 


0.5 


1.6 


0.3 


0.3 
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Table 12 



Binding of biotinylated cFI 9 antibody to non-translteted HT- 
lOSOcelis 


Concentratton of anti- 
body 


Mean fluorescence intensity 


[ng/ml] 


Biotinytated htgG1 


Biotinylated cF19 


5.000.0 


0.1 


0.1 


1,000.0 


0.1 


0.1 


200.0 


0.1 


0.1 


40.0 


0.1 


0.1 


8.0 


0.1 


0.1 


1.6 


0.1 


0.1 



Table 13 



Competition of anti-FAP antibodies witii tlie binding of biotinylated 
cF19 to HT-1080FAP done 33 celis 




Concentration of com- 
petitor antibody 


Mean fluorescence con- 
centration 


Competitor antibody 


[^g/mL] 




no 


0.00 


11.2 


hIgQI 


1.00 


9.0 


hIgGI 


3.16 


11-3 


hIgGI 


10.00 


9.8 


hIgGI 


31.66 


10.3 


cF19 


1.00 


7.5 


cF19 


3.16 


4.8 


cF19 


10.00 


1.3 


cF19 


31.66 


1.2 


LaHc 


1.00 


8.0 


LaHc 


3.16 


5.5 


LaHc 


10.00 


2.9 


LaHc 


31.66 


1.7 


Biotinylated cF19 was used at a concentration of 1 ^g/mL in alt tests shown 
in the table. 



Example 7: in vitro immune effector functions of monoclonal antibody LaHc 

[01101 This experiment was conducted to detemtine the potential of the monoclonal antibody (mab) LaHq with spe- 
cificity tor f ibroblast activation antigen (FAR) to lyse FAP-expressIng targets in the presence of human complement or 
human mononuclear leukocytes, respectively. 
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[0111] In particular, the ability of L/^Hq to mediate cytotoxic effects against HT-10dOFAP clone 33 cells, which 
expressed human FAP on the surface, was studied. Cytotoxicity was determined in vitro using the following approach: 
^^Cr-labelled target cells were incubated in the presence of LaHq with human serum as source of complement or 
human MNC (peripheral blood mononuclear cells) as effectors. Release of ^^Cr war measured as measure of target- 
5 cell lysis. 

[0112] Antaxxlies and cell lines used were L^Hc (reshaped human anti-human FAP IgGI antibody), hlgGl (human 
IgQI isotype control), 3S193 (murine monoclonal anti-Lewis^ lgG3 antibody). mIgG (murine IgG control). HT-10dO 
(human fibrosarcoma). HT-1080FAP clone 33. (HT1080 transfected with cDNA encoding human FAP). MCF-7 (human 
breast adenocarcinoma cell tine). 

10 

Complement-mediated lysis of target cells by L^H^ 

[0113] TunfK>ur cells were radiolabelled by incubation in RPMI1640 medium with 100 ii\ ^^Cr (NEN) at 37"* C for one 
hour. Subsequently, cells were washed twice in ^^Cr-free medium and resuspended at a concentration of 2x10^ cells 
IS per ml. 

[01 14] Human serum as source of complement was freshly prepared from blood of different volunteers. Blood was 
taken by puncturing the arm vein, remained at room temperature for one hour to allow clotting to occur, and was kept 
at 4*' C over night. Seaim was seperated by centrtfugation and taken off from the sediment. 
[01 1 5] The antibody under study was diluted from the stock solution to the appropriate concentration in RPMi1640 
20 cell culture medium. 

[0116] 1x10"^ radiolabelled tumour cells of the indicated cell line were incubated in the presence of different concen- 
trations of test or control antibody and 25% of the human serum used as source of complement for 2 h at 37*" C in a 
95% air and 5% CO2 incubator. Incubation was performed in U-shaped 96-well plates in a total volume of 200 ^1 
RPMI1640 and done in triplicate. After the incubation period, plates were centrifugated. 100 fil of the supernatant were 
25 taken off and radioactivity was determined in a gamma-counter. Total number of incorporated radioactivity was deter- 
mined by measuring 1 0^ target cells. Spontaneous release was defined as activity released from the target cells in the 
absence of both antibody and complement during the described incubation period. 
[01 1 7] Specific lysis was calculated as follows: 

30 

[activity sample] - [activity spontaneous release] 

specificlysis (in %) = x100 

[maximum activity] - [activity spontaneous release] 



40 Antibody-dependent cellular cytrrtoxicity (ADCC) of L^H^ 

[0118] Tumour cells were radiolabelled by incubation in RPMI1640 mecfium with 100 \i\ ^^Cr at 37*'C for one hour. 
Subsequently, cells were washed twice in ^^Cr-free medium and resuspended at a concentration of 2x10^ cells per mL. 
[0119] MNC (peripheral Wood mononuclear cells) were prepared from peripheral blood taken by puncturing the arm 

45 vein of different healthy human volunteers. Clotting was prevented by the addition of 20% citrate buffer. MNC from 4 mL 
of this blood preparation were purified by centrifugation (30 min at 400 G and room temperature) on 3 mL of lymphocyte 
preparation medium (Boehringer Mannheim. Germany). MNC (peripheral blood mononuclear ceils) were taken off from 
the gradient, washed three times and diluted with RPM11640 to the appropriate concentration. Lymphocyte activated 
killer (LAK) cells were derived from MNC (peripheral Wood mononuclear cells) by incubation for 5 days at 37° C In an 

so 95% air and 5% CO2 incubator at an initial density of 1 .3x1 0^ celts per mL in the presence of 1 0OU recombinant human 
lnterleukln-2 (iL-2). The antibody under study was diluted from the stock solution to the appropriate concentration in 
RPMI1640 cell culture medium. 

[0120] 1x10* radiolabelled tumour cells of the indicated cell line were incubated for 5 h at 37°C and 5%C02 in the 
presence of different concentrations of test or control antibody and MNC (peripheral Wood mononuclear cells) in a 
55 number necessary to reach the indicated effector.larget cell ratio. Incubation was performed in U-shaped 96-weil plates 
In a total volume of 200 ^1 RPMI1640 and done in dupllcata 

[0121 ] After the incubation period, plates were centrifugated, 1 00 mI of the supernatant were taken off and radioactiv- 
ity was determined in a gamma-counter. Total number of incorporated radioactivity was determined by measuring 10"^ 
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target cells. Spontaneous release was defined as activity released from the target cells in the absence of both antibody 
and effector cells during the described incubation period. 
[01 22] Specific lysis was calculated as follows: 



10 



[activity sample] - [activity spontaneous release] 

specific lysis (in %)= — — — ~ x 1 00 

[maximum activity] - [activity spontaneous release] 



IS Antibody mediated compiem ent Ivsis of tumour cells 

[0123] No complement mediated lysis above control was seen in HT-1080FAP clone 33 cells with UHc up to a con- 
centration of 50 ng/mL (Table 14, Table 15a) 

[01 24] Lytic activity of human serum used as source of compiement was shown by lysis of MCF-7 human breast car- 
20 dnoma cells In the presence of 12.5 ng/mL 3S193. a murine monoclonal anti-Lewis^ antibody with known complement 
activating ability (Table 15b) 

Antibody mediated cellular Ivsis of tumour cells 

[01251 In the presence of LaHq in a concentration of up to 10 jig/mL. no lysis of HT-1080FAP clone 33 above Isotype 
control was detectable in ADCC mediated by human MNC (peripheral blood mononuclear cells, Table 16) or human 
LAK cells (lymphokine activated killer cell) (Table 17) at an effector rtarget ratio of 50:1 : 

[0126] In appropriate in vitro assays with either human complement or with human MNC (peripheral bkxxl mononu- 
clear cells) as effector mechanisms, human anti-FAP monoclonal antibody LaHc revealed no relevant cytotoxic effect 
above controls on FAP expressing tumor cell line HT-1080FAP clone 33. 

[0127] In vitra LaHq is unable to mediate cytotoxicity effected by human conrplement or human MNC (peripheral 
blood mononuclear ceUs) on a cell line positive for FAR tiie antigen recognized by this antibody. 
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Table 14 



Specific complement lysis (in %) of HT-1080FAP clone 33 
tumor ceil targets mediated by LaHq 


Source of human serum: 


HT-1080 clone 33: 


concentration of anti- 
body 


hIgGI isotype control 


LaHc 


A50^g/mL 


5 


4 


A10ug/rnL 


5 


3 


B 50 ^g/mL 


7 


5 


BIO^g/mL 


6 


5 


0 ng/mL 


0 


0 


Incubation: 2 hours at 37°C, 25% serum from human volunteers A 

or B, respectively, as source of complement. 
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Table 15a 



10 



15 



20 



2S 
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40 



45 



50 



55 



Specific complement lysis (in %) of HT-1080FAP clone 33 
tumor cell targets mediated by human anti-FAP mono- 
clonal antibody L^Hc 


Sourcfi of huiTian fiarum* 


HTIOSOdone 33: 


^A/l iwvl lU dUWI 1 CU 1(1 

body 


hln^^l 




A10.00Mg/ml 


2 


1 


A^50^g/mi 


2 


2 


A 0.60 jig/m( 


1 


1 


AO-IS^g/ml 


1 


2 


A 0.00 ^ig/ml 


2 


2 


B 10.00 ^ig/ml 


2 


2 


B 2.50 \iQlm\ 


2 


2 


B 0.60 (xg/ml 


2 


2 


B0.15^g/ml 


2 


2 


B 0.00 \iQ/m\ 


2 


2 


C 10.00 MQ/ml 


2 


2 


C 2.50 iLQ/vnH 


1 


1 


C 0.60 ^g/ml 


1 


1 


C 0.15 fig/ml 


2 


1 


C 0.00 ^g/ml 


3 


3 


Incubation: 2 hours at 37°C, 25% serum from human volun- 
teers A, B or C, respectively, as source of complement. 


Table 15b 



Specific compiement lysis (in %) of IVICF-7 tumour cell 
targets mediated by murine anti-Lewisy monoclonal 
antibody 3S193 


Source of human serum: 


MCF-7: 


concentration of anti- 
body 


mIgG 


3S193 


A 10.00 jig/ml 


0 


21 


A 2.50 MQ/ml 


1 


21 


A 0.60 ^g/ml 


0 


21 


AO.ISpg/ml 


1 


18 


A 0.00 fig/ml 


0 


0 


B 10.00 ^ig/ml 


1 


13 


B2.S0^g/ml 


0 


17 
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Table 15b (continued) 



Specific complement lysis (In %) of MCF-7 tumour cell 
targets mediated by murine anti-Lewis^ monoclonal 
antit>ody 381 d3 


Source of human serum: 


MCF-7: 


concentration of anti* 
bodiy 


mIgG 


3S193 


B 0.60 ^g/ml 


1 


18 


B0.15^g/ml 


1 


15 


B 0.00 ng/ml 


0 


0 


C10.00Mg/ml 


1 


22 


C 2.50 jig/ml 


0 


23 


C 0.60 ng/ml 


1 


26 


C0.15 ng/ml 


1 


20 


C 0.00 ng/ml 


1 


1 


Incubation: 2 hours at 37" C, 25% serum from human volun- 
teers A, B or C. as source of complement. 



Table 16 



ADCC (antibody-dependant cellular cytotoxicity) (specific lysis in 
%) of HT-1080FAP done 33 target cells by human MNC (peripheral 
blood mononuclear cells) mediated by LaH^. 


HT-IOSOFAPdone 33: 


Concentration of anti- 
body: 


HT-1080FAP clone 33: 


pn ng/mLJ 


hIgGI 


LaHc 


10.000 


2 


2 


2.500 


2 


2 


0.625 


2 


2 


0.156 


3 


3 


0.000 


3 


3 


Incubation: 5 hours at 37''C, 10"^ target cells and an effector:target cell 
ration of 50:1. 
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Table 17 



ADCC (antibody-dependenat cellular cytotoxicity, sproif ic lysis In 
%) of HT-1080FAP clone 33 target cells by LAK cells (lymphokine 
activated idller cells) mediated by LaHc. 


Concentration of anti- 
body: 


HT-1080FAP clone 33: 


[in ng/mL] 


hIgGi 


LaHc 


10.000 


12 


14 


2.500 


14 


17 


0.625 


14 


21 


0.156 


15 


21 


0.000 


14 


14 


Incubation: 5 hours at 37''C, 10^ target cells and an effectorlarget cell 
ration of 50:1. 



Example 8: Immunohistochemical analysis of monoclonal antibody LaHc binding to normal and neoplastic 
liuman tissues 

[01 28] This experiment was performed to determine the binding characteristics of the humanized mAb LaHq to nor- 
mal and neoplastic human tissues. 

[0129] The following antibodies were used: LaHq. cF19, and the negative control hu IgGi were directly biotinylated 
according to methods of the state of the art and used at concentrations of 2.5 to 0.25 mg/ ml in 2% BSA/PBS (bovine 
serum albumin in phosphate-buffered saline). Murine mAb F19 was used as tissue culture supernatant of the F19 hybri- 
doma, at dilutions of 1 :5 to 1 :1 0 in 2% BSA/PBS. 

[01 30] The following reagents were used for immunochemical assays: Streptavidin peroxidase complex (Vector Labs. , 
Burlingame. CA, USA), Avidin-biotin peroxidase complex (Vector Labs ), Biotinylated horse anti-mouse (Vector Labs.). 
DAB (diaminobenzidine, Sigma Chemical Co. St. Loiis, MO. USA). Harrris' hematoxylin. 

[01 31 ] Fresh frozen tissue samples examined included the following: Normal colon, breast, lung, stomach, pancreas, 
skin, larynx, urinary bladder, smooth and skeletal muscle. 

[0132] Among the tumors tested were carcinomas from breast, colon, lung, esophagus, uterus, ovary, pancreas, 
stomach, and head and neck. 

[0133] An indirect immunoperoxidase method was carried out according to state of the art methods (Garin-Chesa P, 
Old LJ. Rettig WJ: Cell surface glycoprotein of reactive stromal ftoroblasts as a potential antibody target in human epi- 
thelial cancers. Proc Natl Acd Sci USA 1990: 87:7235-7239) on five micrometer thickness fresh frozen sections. 
[0134] DAB was used as a substrate for the final reaction product. The sections were counterstained with Harris' 
hematoxylin and examined for antigen expressnn. 

LAHr^expression in nornral human tissues 

[0135] The normal tissues tested were negative for LaHq expression, except for the normal panaeas in which a sub- 
set of positive endocrine cells In the islets of Langerhans (A cells) were identified with LaHq. cF19 and F19. (Table 18). 
No immunoreactivily was observed with the hu IgGi (human immunoglobulin IgGi sutxiass) used as a negative con- 
trol. 

L^Hn expression in tumors 

[0136] In the tumor samples. LaHc. cF19 and F19 showed an indistinguishable pattern of expression in the tumor 
stromal fibroblasts. A strong and homogeneous expression was found in the majority of the cases examined, especially 
55 In the cancer samples derived from breast, colon, lung, pancreas and in the squamous ceD carcinomas (SQCC) of the 
head and neck tested (Table 19). No immunoreactivily was obsen^ed with the hu IgGi used as negative control. 
[0137] LaHc, cF19 and F19 showed immunoreactivity with the tumor stromal fibrotrfasts in the epithelial cancer sam- 
ples tested. No LaHq or F19 immuno-reactivity was seen with either the fibrocytes of the normal organ mesenchyme or 
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the parenchymal cells of normal aduH organs. The only exception was a subset of endoaine cells in the panaeatic 
islets, presumably glucagon-producing A cells, which react with the anti-FAP antitjodies. 

[0138] Immunohistochemical analysis of L^Hc in normal human tissues and FAP-expressing hunfian carcinomas 
showed indistinguishable patterns of binding for L^Ho cF19 and murine mAb F19. 



Table 18 







Immunoreactivity of mAbs Lpfi^ cF19 and F19 with normal 


human tissues 


10 


Tissue type 




Criy 


F1Q 

r 1*7 




Breast 




-Duct epithelium 
















-Myoepithelial cells 










Colon 




-Glandular epithelium 








IS 








-Smooth musde 










Lung 




-Bronchial epithelium 
















•Ah/eolar epithelium 








20 




Stomach 


-Glandular epithelium 
















-Smooth muscle 












Urinary bladder 


-Urothelium 


















-Smooth muscle 








25 




Pancreas 


-Exoaine acini 
















-Endocrine islet cells 


+ subset only 


-i-subset only 


•1- subset only 




Larynx -Squamous epithelium 








30 


Lymph node -Lymphocytes 










Skeletal muscle- 










Connective tissue 










Skin 






-KoBtinocytes 








35 








-Sweat glands 









40 

Table 19 



Immunoreactivity off mAbs LaHc, cF19 and F19 with human tumor samples 


Tumor type 


No. 


LaHc 


cF19 


F19 


Breast cancers (infiltrating ductal type) 


7 


7 Positive 


7 Positive 


7 Positive 


Colon cancers (adenocarcinomas) 


7 


7 Positive 


7 Positive 


7 Positive 


Lung carcinomas (adenocarcinoma (2) 
large cell type (2) squamous type (4) 


8 


7 Positive 
1 Negative 


7 Positive 
1 Negative 


7 Positive 
1 Negative 


Esophageal cancers (squamous type) 


1 


1 Positive 


1 Positive 


1 Positive 


Endometrial cancers (adenocarcinoma) 


1 


1 Negative 


1 Negative 


1 Negative 


Gastric cancers (adenocarcinoma) 


2 


2 Negative 


2 Negative 


2 Negative 


(Dvarian cancers (serous denocarcinoma) 


2 


1 Positive 
1 Negative 


1 Positive 
1 Negative 


1 Positive 
1 Negative 



30 




EP0 953 639A1 

Table 19 (continued) 



Immunoreactivity of mAbs L^Hc, CF19 and F19 with human tumor samples 


Tumor type 


No. 




cF19 


F19 


Pancreatic cancers (adenocarcinomas) 


2 


2 Positive 


2 Positive 


2 Positive 


Head and neck cancers (squamous ceil 
type) 


4 


4 Positive 


4 Positive 


4 Positive 


Abbreviations: No, number of cases from different patients studied; positive, number of cases showing antigen 
expression in the tumor stroma: negative, number of casestested that lacked detectable antigen expression. 



Example 9: Species specificity of LaHq binding in tissue sections 



IS [0139] This experiment was conducted to assess the reactivity of LaHc with tissues from mouse, rat. rabbit and 
cynomolgus monkeys by immunohistochemical methods. 

[0140] Also used in these tests were cF19 and hulgGi as negative controls. The reagents used for Immunohisto- 
chemistry were Streptavidin peroxidase complex (Vector Labs., Buriingame. CA. USA). DAB (Sigma Chemical Ck).. St. 
Louis, MO, USA) and Harris' hematoxylin. 
20 [0141] The following fresh frozen tissue samples from mouse, rat. rabbit and cynomolgus were tested: Brain, liver, 
lung, kidney, stomach, pancreas, intestine, thymus, skin, muscle, heart, spleen, ovary, uterus and testes. As positive 
control, sections from normal human pancreas and a breast carcinoma sample were includded in every assay. 

Immunohistochemistry 

25 

[0142] An indirect immunoperoxidase method was carried out as described in the state of the art (Qarin-Chesa P. Old 
U. Rettig WJ: Cell surface glycoprotein of reactive stromal fibroblasts as a potential antibody target In human epithelial 
cancers. Proc Natl Acad Sci USA 1990; 87:7235-7239) on five miaometer thickness fresh frozen sections. The anti- 
bodies LaHc, cFI 9 and hu IgGi (at 1 jig/ml) were biotinylated according to the state of the art and were detected with 
30 streptavidin peroxidase complex. DAB was used as a sut)strate for the final reaction product. The sections were coun- 
terstained with Harris' hematoxylin and examined for antigen expression. 

[0143] The normal tissues tested did not react with either LaHq or cF1 9 in the experiments (Table 1 ). 
[0144] The normal human pancreas used as positive control showed L^Hq and cFI 9 binding in a subset of endocrine 
cells in the islets of Langerhans as previously described for F19. In addition, binding of LaHq and cF19 was seen in the 
35 tunror stromal fibroblasts In the breast carcinoma sample. 

[0145] Immunohistochemical analysis of normal tissues from mouse, rat rabbit and cynomolgus failed to detect any 
binding of either L^Hc or cF19. in the experiments performed. 
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Table 20 



Binding of LaHq to tissue sections of non-liuman species, as detennined by immunohlsto- 

chemlstry. 


Organ/ Tissue typ 


Mouse 


Rat 


Rabbft 


Cynomolgus 


Brain 


-Cerebral cortex 

-Cerebellum 










Liver 


-Hepatocytes 
-Portal triad 






- 


- 


Lung 


-Bronchi 
-Alveoli 






- 


- 


Kidney 


-Glomeruli 
-Tubular epithelium 






- 


- 


Stonnach 


-Glandular epithelium 
-Smooth musde 










Panaeas 


-Exocrine acini 
-Endocrine islets 










Intestine 


-Glandular epithelium 
-Smooth musde 






- 


- 


Thynius -Lymphocytes 










Skin 


-Keratinocytes 
-Sweat glands 
•Hair follicles 






- 


- 


Skeletal musde 










Heart 










Spleen 'Lymphocytes 










Ovary 


-Follicular epithelium 
-Stroma 










Uterus 


-Myometrium 
-Cervix uteri 










Testis -Tubular epithelium 


nt 


nt 


nt 




Connective tissue 











nt, not tested 



50 Example 10: Construction of ceil iines producing chimeric and reshaped anti-FAP monocional antibodies 

[0146] The objective of this experiment was to demonstrate stable cell lines according to the invention expressing 
LaHc. La Ha, LbHb, LbHd. and cF19 in CHO DG44 cells. Stable cell lines transfected with humanized or chimeric F19 
antibodies were produced and their identity was confirmed by PCR anplification o1 heavy and light variable regions 
55 using genomic DANN derived from each transfectant as template. 

[0147] CHO DG44 cells maintained under serum-free conditions in SFM-II medium. Lipofectin and SFM-I! serum-free 
medium were otrtained from Gibco/BRL. Genetidn and all restriction enzymes were obtained from Boehringer Man- 
nheim. Pfu polymerase was obtained from Stratagene. 



32 




EP0953 639A1 



[0148] DNA for transfections was purified from E. coli cells using QiaFilter Maxi Cartridges (Qiagen) as directed by 
the manufacturer. All DNA preparations were examined by restriction enzyme digestion. Sequences of I^Hq variable 
regions in their respective vectors were confirmed using an ABI PRISM 310 Sequencer. 

[0149] Further information regarding the vectors and DNA sequences employed is available in the prior examples. 

5 

Transfection of CHO DG44 cells 

[0150] Cells in logarithmic growth were plated into 6 well plates containing 1 mL fresh SFM-II medium. Plasmids 
encoding heavy and light chains of humanized or chimeric F19 verions were cotransfected into CHO DQ44 cells using 

10 liposomal transfection. Liposomes were prepared using 6 ^l Lipofectin reagent and 0.5 ^g of each vector (one for the 
desired heavy chain and one for the light) as described for UpofectAMINE transfections except that SFM-II medium was 
used to dilute ail reagents. Twenty-four hours later, cells were diluted 1 :10 into SFM-II medium containing 300 ng/mL 
Geneticin. After the initial phase of cell killing was over (10-14 days), the concentration of Geneticin was reduced to 200 
mg/hiL and methotrexate was added to a final concentration of 5 nM. Methotrexate concentrations were inaeased after 

75 1 0-1 4 days to a final concentration of 20 nM. 

PGR Amplification of tr ansfectant DNA 

[01 51 ] 1 0^ CHO DG44 cells were centrifuged in an Eppendorf miaocentrifuge briefly at full speed, washed once with 
20 PBS, and pelleted once again. Genomic DNA was prepared by ethand precipitation after SDS lysis and Proteinase K 
treatment of the cell pellets. 

[01 52] A mixture containing one of the following primer pairs. dNTPs, buffer, and Pf u polymerase was used to amplify 
either the heavy or light chain variable region using genomic DNA as template. The resulting PCR products were 
digested with the appropriate restriction enzyme and analyzed by agarose gel electrophoresis to confirm their identity. 

25 

Light chain primer set: 
[P153] 

30 5-GAG ACA TTG TQA CCC AAT CTC C ■ 3' PKN 1 690 

5'- GAC AGT CAT AAA CTG CCA CAT CTT C - 3* PKN. 1 930. R 
Heavy chain primer set: 

36 

10154] 

5'-TTQ ACA CGC GTC TCG GQA AGC TT - 3* PG 5863 

40 S- GGC GCA GAG GAT CCA CTC ACC T - 3' PG 6332. R 

[0155] The undigested heavy chain PCR product has a predicted size of 469 bp while the light chain PCR product 
has a predicted size of 286 bp. Verification of identity was determined by restriction enzyme digest with BstEII (heavy 
chain) or NIalV (light chain). 

45 [0156] CHO cell lines were transfected with LaHq, La Ha. LeHg. LbH[> as well as cFI 9. Genetidn-resistant cells were 
obtained and these cells were further selected for resistance to methotrexate. PCR amplif icatbn of the light and heavy 
chain DNA produced the expected bands and confirmed the identity of LaHq, LaHa and L^Hq transfectants. The I^Hq 
full length heavy chain PCR product was subcloned and resequenced in its entirety. 

[0157] The cells described were maintained under serum-free conditions at all times and were not treated with ani- 
50 mal-derived products such as trypsin. 

[01 58] Producer cell lines transfected with expressing monoclonal LaHc. LaHa, LbHb, LbHq and cFI 9 antibodies were 
produced. Their identities were confirmed using PCR amplification of both their heavy and light chain variable regions. 
The DNA sequence of the heavy chain variable region PGR products for LAHc-transfected cells was confirmed. 

55 Example 1 1 lExpression of antibody proteins in Chinese hamster ovary DG 44 cells and their purif icalion 

[0159] The objective of this experiment was to express and purif iy of LaHq, LaHa, L^Hq, and LbHq mAbs to enable 
their characterization. Other goals included the establishment of a quantitative ELISA to permit measurement of anti- 
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body concentrations in both crude media samples as well as purified Ig samples and detemiination of relative expres- 
sion levels of various humanized F19 constructs using this assay. 

[0160] Serum-free CHO DQ44 cells and USP-grade methotrexate were obtained from the Biotechnical Production 
Unit of the Dr. Karl Thomae GmbH. Biberach. Germany; both products are also commercially available. Cells were 

5 maintained under serum-free conditions at all times. SFM-II serum-free medium was obtained from Gibco/BRL 

[01611 Protein A agarose was from Pierce Chemical (Indianapolis, IN, USA). Human lgG1 standards (Cat. No. I 
3889) p-Nitrophenyl phosphate tablets (N 2640). bovine serum albumin (BSA) (A 7906). and goat anti-human kappa 
chain specific alkaline phosphatase-conjugated antibody (A 381 3) were obtained from Sigma Chemical (St Louis. MO. 
USA) Goat anti-human gamma-chain specific alkaline phosphatase-conjugated antibody was obtained from Jackson 

10 Immunoresearch Labaatories (through Stratech Scientific). Tris-buffered saline (TBS) consisted of 150 mM NaCI, 50 
ml^ Tris. pH 7.5. 

Q ^w rutturft conditions for antibody expression 

IS [01621 Cells were cultured and LAHc-producing cells were maintained in T-1 75 flasks In SFM-II serum-free medium 
without agitation. The medium contained 200 ugMiL Geneticin and 20 nM methotrexate without antibiotics. Cells were 
passaged by dilution, were not adherent and grew in small clusters. When the cells reached stationary phase, the 
medium was collected and centrifuged to remove cells and frozen at ■2a*C until needed. 



20 Purification of L^Hn 

[0163] All purification steps were earned out at 4» C. A CIO/10 column (Pharmacia Fine Chemicals) was packed with 
Protein A agarose (3 mL bed volume). The column was washed with TBS and preeluted once with 0.1 M Na citrate. pH 
3 0 to insure that no loosely bound material remained on the column. The column was then immediately reequilibrated 

25 with TBS and stored at 4«C. Spent culture supernatants were thawed and centrifuged at 1 0.000 xg for 30 minutes pnor 
to Protein A chromatography to remove debris and diluted with an equal volume of TBS. This material was loaded onto 
the Protein A column at 0.5 mUmin using a P-1 peristaltic pump (Pharmacia) and washed with TBS until the absorb- 
ance at 280 nm was undetectable. Elution of the anibody was initiated with 0. 1 M Na citrate pH 3.0 at approximately 0.2 
mLVirtn. The elution was monitored at 280 nm and one mL fractions of the eluted material were collected into tubes con- 

30 taining sufficient Tris base pH 9 to neutralize the citrate buffer. Protein-containing fractions were pooled and concen- 
trated using an Amicon filtration apparatus with a YM-30 filter and dialyzed against PBS. The column was immediately 
regenerated with TBS. Protein dye-binding assays were performed with the BioRad (Hercules. California) protein deter- 
mination kit. according to the manufacturer's instructions, using bovine serum albumin as a standard. 



3S Human I9G fg amma immunoglobulin) ELISA 

[01641 ELISA plates were coated overnight with 100 \iL of goat anti-human gamma-chain specific alkaline phos- 
phatase-conjugated antibody at 0.4 mg/mL in coating buffer at 4«a Coating antibody was removed and plates were 
blocked with 2% BSA in PBS for 2 hours. All subsequent steps were performed at 37*»C. Blocking buffer was replaced 

40 with antibody samples or human IgGI standard diluted in dilution buffer, s^ially diluted in a 200mL volume, and incu- 
bated for one hour. Negative controls included dilution buffer and/or culture medium of nontransfected celts. Wells were 
washed and 100 uL of goat anti-human kappa chain specific alkaline phosphatase-corijugated antibody diluted 1:5000 
was added and incubated for one hour. Wells were washed and 1 00 |iL reaction buffer was added and incubated for 30 
minutes. The reaction was stopped by addition of 1 M NaOH and absorbance read at 405 nm in an EUSA plate reader. 

45 Results were analyzed by four-parameter iterative cun/e fitting. 

[01651 Amino acid analysis was performed according to methods available in the state of the art. 
[01661 Monoclonal antibody LaHq was produced and purified to homogeneity using Protein A affinity chromatography. 
ELISA assays using human IgGI as standard indicated LaHc recoveries exceeding 70%. The purity of the matenal was 
estimated to be >90% by SDS-polyacrylamide gel electrophoresis. Representative expression data and typical puriffica- 

50 tion yields are shown in Table 21 . 
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Table 21 



Expression data and purification yieids FAP antibody proteins in CHO ceils 


Antibody 


Expression levels in 
crude media samples 
(ELISA) 


Purified antibody yields 


Yield improvement [puri- 
fied antibody] 




7- 10mg/L 


- 5 - 7 mg/L 


500-700 




5 ■ 7 mg/mL 


-3-4 mg/L 


300-400 


HbLb 


0,5 - 1 mg/mL 


- 0.2 - 0.5 mg/L 


20-50 




0.8 - 1 .5 mg/mL 


- 0.3 - 0.8 mg/L 


30-60 


Chimeric F19 


- 0.02 mg/hiL 


< 0.01 mg/L 


1 


Representative expression data for each of tlie anti-FAP antibodies produced in this study are shown. 
Recoveries after Protein A agarose affinity chromatography were based on protein dye-binding measure- 
ments of the purified Ig using BSA as a standard. 



20 

Example 12: Binding of monoclonal antibody L^Hc to isolated recombinant human FAP 
[D167] The objective of this study was to characterize binding of LaHc to isolated recombinant human FAR 
25 CD8-FAP ELISA 

[0168] ELISA plates were coated overnight with 100 ^Lof mouse anti-rat antibody (Sigma Chemical R0761) at 1:2000 
in coating buffer at 4 *»C. Coating antibody was removed and plates were blocked with 2% BSA In PBS for one hour. /Ml 
subsequent steps were performed at room temperature. Blocking buffer was replaced with 100 mL of 1 ng/mL rat anti- 

30 CDS antibody (Pharmingen 01 041 D) and incubated for one hour. Plates were washed and 100 ^L CD8-FAP culture 
1^ supernatant (1 :2 in PBS) was added and allowed to bind for one hour. Plates were washed and antibody samples were 
added (two-fold serial dilutions) in a 100 ^L volume and incubated for one hour. Negative controls included human IgG 
and/or culture medium of nontransfected cells. Wells were washed and 100 ^1 of horse radish peroxidase (HRP) coriju- 
gated mouse anti-human IgGI anttoody (Zymed 05-3320) diluted 1 :500 in dilution buffer were added and incubated for 

35 one hour. Wells were washed and 100 ^L HRP substrate, (aztno-bis (3-ethyfbenzthiazotine S-sulfonic) add. Sigma 
Chemical A9941), were added and incubated for 60 minutes. The reaction was stopped addition of 1 M NaOH and 
absorbance read at 405/490 nm in an ELISA plate reader. Results were analyzed by four parameter curve iterative 
curve fitting. 

[0169] Alternatively, plates were coated directly with cFl9. FAP (recombinant hunran FAP) was allowed to bind to 
40 these plates as above and biotinylated LaHq (-1 ^g/mL) was then added, /^body binding was detected with HRP- 
streptavidin conjugate as akx>ve. 

Solubilization of membrane-bound human FAP 

45 [0170] FAP-expressing 293FAP 1/2 cells or control 293 cells were washed with PBS and lysed with 1% Triton X-1 14 
in Tris-buffered saline. Nuclei and debris were removed by centrifugation at 10.000 xg. The supernatant was phaseisar- 
titioned (Estreicher A. Wohlend A. Beiin D. Scheuning WD Vasalli JD. Characterization of the cellular binding site for the 
urokinase-type plasminogen activator. J Biol Chem 1989; 264:1 180-1 189) to enrich membrane proteins. The detergent 
phase was collected and diluted In buffer containing 1% Empigen BB (Cattxochem) to prevent reaggregation of the Tri- 

50 ton X-1 14. 

[0171] This material was subjected to Concanavalin A agarose chromatography (Rettig WJ. Garin-Chesa P. Healey 
JH. Su SU Ozer HL, Schwab, M, Albino AP. Old LJ. Regulation and heteromeric structure of the fibroblast activation 
protein in namal and transformed cells of mesenchymal and neuroectodermal origin. Cancer Res 1993; 53:3327- 
3335). 

55 

Biotinvlationof L^He 

[0172] LaHc (1 -2 mg) was dialyzed against SOmVI bicarbonate buffer and biotinylated with a ten-fbkJ molar excess of 
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suHbsuccinimidyl-6-biotinannido hexanoate (NHS-LC biolin. Pierce Chemical. Rockford. Illinois, USA) for 2 hours at 
room temperature. Unreacted product was removed by repeated microdialysis In a microconcentrator. 

Transient transfections 

[0173] COS-7 cells (American Type Tissue Culture Collection, reference number GRL 1651) were cotransfected by 
electroporation with the heavy and light chain vectors encoding LaHq. 

[0174] Antj-CD8 monoclonal antibody was immobilized onto microliter plates. CD8-FAP from medium of insect cells 
infected with CD8-FAP baculovirus was allowed to bind to these plates. Spent medium from COS-7 cell cultures tran- 
siently transfected with two separate vectors encoding L^Hc was serially diluted and added to the wells containing the 
inwrobilized CD8-FAR LaHc bound to isolated immobilized CD8-FAP protein (Rgure 35). Culture supernatants from 
mock-transfected COS-7 ceils failed to demonstrate binding. 

[0175] Recombinant membrane-bound FAPfromdetergentextract5of293FAP 1/2 cells or control extracts was serially 

diluted and Immobilized via chimeric F19 monoclonal antibody bound to miaotiter plates. Biotinylated LaHc bound 

recombinant human FAR immobilized with cF19 (Rgure 36) in a concentration-dependent manner. 

[01 7S] LaHc recognized isolated immobilized recombinant human FAP can-ying the epitope for murine F19. LaHc 

bound to both CD8-FAP produced in insect cells, as well as FAP protein produced in 293FAP 1/2 cells. 

[0177] Culture supernatants from C0S7 cells transfected with either heavy and light chain vectors encoding LaHc or 

without DNA (Control) were collectaJ three days posttransfection. CD8-FAP was immobilized via an anti-CD8 antibody 

as described in the text. Serial dilutions of the C0S7 supernatants were allowed to bind to the immobilized CD8-FAP 

and subsequently detected with an HRP-conjugated anti-human lgG1 antibody 

[0178] Detergent extracts of FAP-expressing 293FAP 1/2 cells or control 293 cells were serially diluted and added to 
cF19-ooated microtiter plates. Biotinylated LaHq was added and binding of biotinylated LaHq was detected with HRP- 
conjugated streptavidin. 

Example 13: Characterization of HT-1 080 fibrosarcoma cells and 293 human embryonic iddney ceils trans- 
f octets tsflth cDNA fdr human FAP 

[0179] Fibroblast activation protein (FAP) is a cell-surface, membrane-bound protein which carries the F19 epitope 
and is expressed on tumor stromal fibroblasts. Cell lines expressing recombinant FAP protein and matched controls 
lacking FAP were generated for the characterization of anti-FAP monoclonal antibodies. 

[0180] Cells used were HT-1080 cells (reference mvrber CCL 121) and 293 human embryonic kidney cells (reference 
numb®' CRL 1573) were obtained from the /American Type Culture Collection (Maryland. USA). Transfedam was 
obtained from Promega. Geneticin and all restriction enzymes were obtained from Boehringer Mannheim. DNA for 
transfections was purified from E. coli ceUs using QiaFilter Maxi Cartridges (Qiagen) as directed by the manufacturer. 
All DNA preparations were examined by restrictfon enzyme digestion. Vector sequences were conf inned i^ng an ABI 
PRISM 310 Sequencer. 

[01 81 ] Further information regarding the vectors and DNA sequences employed has beei described in Scanlan MJ. 
Raj BK. Calvo B. Garin-Che^ P. Sanz-Moncasi MP. Healey JH. Od U. Rettig WJ. Molecular cloning of fibroblast acti- 
vation protein alpha, a member of the serine protease family selectively expressed in stromal fibroblasts of epithelial 
cancers. Proc Natl Acad Sci USA 1992: 89:10832-10836. The FAP cDNA sequence has been deposited in Genbank 
(accession nunt>er HS09287). 

Cell culture and immunoassays 

[0182] HT-1080 cells were transfected with 1 mg DNA using Transfectam according to the maufacturer's instructions. 
Human en*ryonic kidney 293 cells were transfected by calcium phosphate transfection (Brann MR; Buckley NJ; Jones 
SVP; Bonner Tl. 

[0183] Expression of cloned muscarinic receptor in A9 L cells. Md Pharmacol 1987; 32:450-455) with 10 mg DNA. 
Twenty-tour hours later, cells were diluted 1:10 into fresh medium containing 200 mg/mL Geneticin. Colonies were 
picked and examined by immunofluorescence for FAP expression as described in Rettig WJ; Garin-Chesa P; Beresford 
HR; Oettgen HF; Melamed MR; Old U. CelNsuiace glycoproteins of human sarcomas: differential expresston in nor- 
mal and n^lignant tissues and cultured cells. Proc Natl Acad Sci USA 1988; 85:31 10-31 14. 
[0184] Immunoprecipitations with cF19 were performed with metabolically labelled cells as described in Rettig WJ. 
Qarin-Chesa P. Healey JH. Su SU Ozer HU Schwab, M. Albino AP, Old U. Regulation and heteromeric structure of the 
fibroblast activation protein in normal and transformed cells of mesenchymal and neuroectodermal origin. Cancer Res 
1993; 53:3327-3335. 

[0185] HT-1080 and 293 cells were tested for FAP antigen expression in immunofluorescence assays with anti-FAP 
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antibodies and were found to be antigen-negative. Transfection of these cells with FAR38 vector resulted In the gener- 
ation of Genetictn-resistant colonies. Isolated colonies were picked and analyzed by immunofluorescence for FAP 
expression. Two cell clones were identified, designated HT-1080FAP clone 33 and 293FAP 1/2, which express cell sur- 
face-bound FAP protein, as recognized by cF19 antibody. Staining of norpemieabilized HT-1080FAP clone 33 cells and 
5 293FAP 1/2 with cF19 antibody confirmed the cell surface localization of the FAP protein. 

[0186] Immunopreclpitation of radiolabelled FAP protein with cF19 from extracts of ^^S-methionine labelled HT- 
1080FAP clone 33 cells or 293FAP 1/2 cells resulted in the appearance of a 93 Idlodalton band after autoradiography. 
This band Is absent In Inmunopredpitates of parental HT-1080 or 293 cell extracts. 

[0187] Two stably transfected cell lines. HT-1080FAP clone 33 and 293FAP 1/2, express FAP on the cell surface as 
10 determined in immunological assays with anti-FAP mAbs. Neither parental HT-1080 cells nor parental 293 cells express 
detectable levels of FAR 

Example 14: Generation and characterization of CD8-FAP fusion protein 

IS [0188] A soluble form of human FAP (fibroblast activation protein) in the form of a CD8-FAP fusion protein was pro- 
duced in insect cells for the characterization of LaHc containing the binding site for antl-FAP mM)s. Murine CDS was 
chosen to permit secretion of the protein and to provide an additional epitope tag. 

[0189] The cONA encoding the extracellular domain of CDS. consisting of the first 189 amino acids of murine CDS, 
was linked to that of the extracellular domain of FAP (amino acids 27 to 760). essentially as described by Lane, et al. 

20 (Lane P, Brock©- T Hubele S, Padovan E. Lazavecchia A. McConnell. Soluble GD40 ligand can replace the normal T 
ceil-derived CD40 ligand signal to B cells in T cell-dependent activation. J Exp Med 1993, 177:1209-1213) using stand- 
ard PGR protocols. The authenticity of all clones was verified by DNA sequencing. The resulting DN A was inserted into 
the pVL1393 vector (Invitrogen) and transfection of Sf9 cells (Invitrogen) with this vector and amplification of the result- 
ing recombinant baculovirus were performed as descrbed (Baculovirus Expression Vectors. A Laboratory Manual. 

25 O'Reilly DR. Miller LK. Luckow VA. (Eds.). Oxford University Press: New York. 1994). The spent medium of High Five'*" 
cells (Invitrogen) infected with recombinant CD8-FAP bacutovlrus for four days was collected and cleared by ultracen- 
trifugation. 

[0190] The CD8-FAP ELISA (enzyme-linked immunosorbent assay) has been described above (Example 12). 
[0191] Insect cell cultures infected with CD8-FAP virus secreted a fusion protein into the medium which carries the 
30 F19 epitope and is recognized by an anti-FAP antibody (Figure 1). Neither the cell culture medium alone nor medium 
■i from insect cells infected with CD8-CD40L fusion protein bound anti-FAP antibody. 

«^ [0192] Soluble CD8-FAP protein carrying the F19 epitope was seaeted into the medium of infected insected cell cul- 
tures. Culture supernatant from cells infected with a control construct did not contain antigen bearing the F19 epitope. 
[01 93] A soluble form of FAR CD8-FAR was produced in insect cells and CD8-FAP was shown to can-y the epitope 
35 recognized by cF19. 

[0194] Supernatants from insect cells infected with recombinant baculovirus encoding either CDS-FAR or GD8-GD40L 
fusion protein were collected four days postinfection. Cell culture medium without cells was used as an additional con- 
trol (medium). Serial dilutions of these materials were added to anti-CDS antibody-coated microliter plates and allowed 
to bind. cF19 (1 mg/mL) was sutjsequently added and allowed to bind. 
40 [01 95] Bound cFI 9 was detected with horseradish peroxidase-oonjugated anti-human antibody. 
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SBQDBRCB LISTING 



(1) GENERAL INFORMATIOK: 

(i) APPLICANT: 

(A) NAME: Boehringer Ingelheim International GmbH 
<E) STRBBT: Rheinatrasse 

(C) CITY: ingelheim am Rhein 

(B) OOOirrRY: Germany 

(F) POSTAL CODE (ZIP) : 55216 

(G) TBLBPHONB: •f*^49-6l32-772770 
<H) TELEFAX: 4-t'49- 6132 -774377 

(ii) TITLE OP INVENTION: FAP alpha- Bpeci£ic antibody vith ii^proved 
produeibi 1 i ty 

(iii> NUMBER OF SBQOBNCES: 101 

(iv) COMPOTBR READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) CQMPOTBR: IBM PC coopatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTNARE: PatentIn Release #1.0, Version ftl.30 (BPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQOBNCE CHARACTERISTICS: 

(A) LENGTH: 339 base pairs 

(B) TYPE: nucleic acid 

e (C) STRANDBDNBSS : double 

(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

GACATTGT8A TOACCCAATC TCCAGACTCT TTGGCTQTGT CTCTAGGGGA GAGGGCCACC 60 

ATCAACTGCA AGTCCAGTCA GAGCCTTTTA TATTCTAGAA ATCAAAAGAA CTACTTGGCC 120 

35 TGGTATCAOC AOAAACCAGG ACAOCCACCC AAACTCCTCA TCrrTTGGGC TAGCACTAGO 180 

GAATCTGGGG TACCTGATAG GTTCAGTGGC AGTGGGTrrG GGACAGACTT CACCCTCACC 240 

ATTAGCAGCC TGCAGGCTGA AGATGTGGCA GTTTATTACT GTCAGCAATA TTTTAGCTAT 300 

CCGCTCAOGT TOGGACAAGG GACCAAGGTG GAAATAAAA 339 
(2) INFORMATION FOR SEQ ID NO: 2: 



40 



(i) SBQDENCB CKARACTBRISTICS : 

(A) LENGTH: 113 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Asp He Val Met Thr Gin Ser Pro Asp Ser Leu Ala Val Ser Leu Gly 
15 10 15 
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61u Arg Ala Ttir He Asn Cye Lys Ser Ser Oln Ser Leu Leu Tyr Ser 
20 25 30 

Arg Asn Gin Lys Asn Tyr Leu Ala Trp Tyr Gin Gin Lys Pro Gly Gin 
35 40 45 

Pro Pro Lye Leu Leu He Phe Trp Ala Ser Thr Arg Glu Ser Gly Val 
SO 55 60 

Pro Asp Arg Phe Ser Gly Ser Gly Phe Gly Thr Asp Phe Thr Leu Thr 
65 70 75 80 

Zle Ser Ser Leu Gin Ala Glu Asp Val Ala Val Tyr Tyr Cye Qln Qln 
85 90 95 

Tyr Phe Ser Tyr Pro Leu Thr Phe Gly Gin Gly Thr Lye Val Glu He 
100 105 110 

Lys 

(2) INFORMATZON FOR SBQ ID NO: 3: 

(i) SBQORNCB CHARACTERISTICS: 
20 (A) LENGTH: 339 base pairs 

(B) TYPB: nucleic acid 

(C) STRANDBDNBS5: double 

(D) TOPOLOGY: linear 



10 



15 



25 



30 



45 



(ii> ISOLBCULB TYPB: CDNA 



(xi) SBQDBNCB DESCRIPTION: SBQ ID NO: 3: 

GACATTGT3A TQACCCAATC TCCAGACTCT TTGGCTGTQT CTCTAOGGQA GAOQGCCACC 60 

ATCAACTGCA AGTCCAGTCA GAGCCTTTTA TATTCTAGAA ATCAAAAGAA CTACTTGGCC 120 

^ TGGTTCCAGC AGAWW:CAGG ACAGCCACCC AAACTCCTCA TCTTTTGGGC TAGCACTAGG 180 

GAATCTGGGG TACCTQATAG GTTCAGTGGC AGTGGGTTTG GGACAGACTT CACCCTCACC 240 

35 A7TAGCAQCC TGCAGOCTGA AGATGTGGCA GTITATGACT GTCAACAATA TTTTAGCTAT 300 

CCGCTCACGT TCGGACAAGG GACCAAGGTG GAAATAAAA 339 

(2) INFORMATION FOR SBQ ID NO: 4: 

<i) SBQDBNCB CHARACTKRISTICS : 
40 (A) LENGTH: 113 amino acids ^ 

(B) TYPB: amino acid 

(C) 3TRANDBDNBSS : single 

(D) TOPOLOGY: linear 



(ii) HOLBCDLB TYPE: peptide 



(xi) SBQDBNCB DESCRIPTION: SBQ ID NO: 4: 

Asp He Val Met Thr Gin Ser Pro Asp Ser Leu Ala Val Ser Leu Gly 
50 1 5 10 15 

Glu Arg Ala Thr He Asn Cye Lys Ser Ser Gin Ser Leu Leu Tyr Ser 
20 25 30 
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Arg Asn Oln Lys Asn Tyr Leu Ala Trp Phe Gin Oln Lys Pro Gly Gin 

35 40 45 

Pro Pro Lys Leu Leu He Phe Trp Ala Ser Thr Arg Glu Ser Gly Val 
50 55 60 

Pro Asp Arg Phe Ser Gly Ser Gly Phe Gly Thr Asp Phe Thr Leu Thr 
65 70 75 60 

He Ser Ser Leu Gin Ala Glu Asp Val Ala Val Tyr Asp Cys Gin Oln 
as 90 95 

Tyx Phe Ser Tyr Pro Leu Thr Phe Gly Gin Gly Thr Lys Val Glu lie 
100 105 110 

Lys 



IS (2) INFORMATION FOR SBQ ID NO: 5: 

(i) SEQUKNCK CHARACTBRISTICS : 

(A) LBNGTH: 339 base pairs 

(B) TYPB: nucleic acid 

(C) STRANDBDNBSS: double 

(D) TOPOLOGY: linear 



(ii) HOLBCDLB TYPB: CDNA 



(xi) SBQDBNCB DESCRIPTION: SBQ ID NO: 5: 

GACATTGTOA TGACCCAATC TCCAGACTCT TTOGCPGTGT CTCTAGGGGA GAOGGCCACC 60 

ATCAACTGCA AGTCCAGTCA GAGCCITTTA TATTCTAGAA ATCAAAAGAA CXACTTGGCC 120 

TGGTATCAGC AGAAACCASG ACA3CCACCC AAACTCCTCA TCTATPGGGC TAGCACTAGO 180 

30 GAATCTGGG6 TACCTQATAG GTTCAGTQGC AGTGGGTTTG GGACAGACTT CACCCTCACC 240 

ATTAGCAOCC TGCAGGCTGA AGATGTG6CA GTTTATTACT GTCAGCAATA TTTTAOCTAT 300 

CCGCTCACGT TCGGACAAGG GACCAASGTG GAAATAAAA 339 
(2) INFORMATION FOR SBQ ID NO: 6: 



<i) SBQUBNCB CHARACTBRISTICS: 

(A) LENGTH: 113 amino acids 

(B) TYPB: amino acid 

<C) STRANDBDNBSS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPB: peptide 



(xi) SBQOENCB DESCRIPTION: SBQ ID NO: 6: 



Asp He Val Met Thr Gin Ser Pro Asp Ser Leu Ala Val Ser Leu Gly 
15 10 15 

Glu Arg Ala Ttir He Asn Cys Lys Ser Ser Gin Ser Leu Leu Tyr Ser 
20 25 30 

SO Arg Asn Gin Lys Asn Tyr Leu Ala Trp Tyr Gin Gin Lys Pro Gly Gin 

35 40 45 

Pro Pro Lys Leu Leu He Tyr Trp Ala Ser Thr Arg Glu Ser Gly Val 
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50 55 60 

Pro Aap Arg Phe Ser Gly Ser Gly Phe Gly Thr Asp Phe itir Leu Ttuc 
65 70 75 ao 

He Ser Ser Leu Gin Ala Olu Asp Val Ala Val Tyr Tyr Cys Gin Gin 

85 90 95 

Tyr Phe Ser Tyr Pro Leu Thr Phe Gly Gin Gly Thr Lys Val Glu He 
100 105 110 

Lys 



(2) INFORMATION FOR SBQ ID NO: 7: 

(i) SBQOENCE CHARACTERISTICS: 

(A) LENGTH: 372 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDBDNBS3: double 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 7: 
CAGGTGCAAC TAGTGCAGTC CGGCGCCGAA GTGAAGAAAC CCGGTGCTTC CGTGAAAGTC 
AGCTGTAAAA CTAGTAGATA CACCTTCACT GAATACACCA TACACTGGGT TAGACAGGCC 
CCT6GCCAAA GGCTGGAGTG GATAOGAGGT ATTAATCCTA ACAATGGTAT TCCTAACTAC 
AACCAGAAGT TCAAGGGCCG GGCCACCTTG ACCGTAGGCA AGTCTGCCAG CACCGCCTAC 
ATGGAACTGT CCAGCCTGC6 CrrOCOAGGAC ACTGCAQTCT ACTACTGC6C CAGAAGAAGA 
ATCGCCTATG GTTAOGACGA GGGCCATGCT ATGGACTACT GGGGTCAAGG AACCCTTGTC 
ACOGTCTCCT CA 

(2) INFORMATION FOR SBQ ID NO: 8: 

(1) SBQOENCB CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 

(C) STRANDBDNBSS : single 

(D) TOPOLOGY: linear 

(ii) MOLBCULB TYPB: peptide 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 8: 

Gin val Gin Leu Val Gin Ser Gly Ala Glu Val Lys Lys Pro Gly Ala 
15 10 15 

Ser Val Lys Val Ser Cys Lys Thr Ser Arg Tyr Thr Phe Thr Glu Tyr 
20 25 30 

Thr He His Trp Val Arg Gin Ala Pro Gly Gin Arg Leu Glu Trp He 
35 40 45 

Gly Gly He Asn Pro Asn Asn Gly He Pro Asn Tyr Asn Gin Lys Phe 
50 55 60 
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Lys Gly Arg Ala Thr Leu Thr Val Gly Lys Ser Ala Ser Thr Ala Tyr 
65 70 75 80 

Mec Glu Leu Ser Ser Leu Arg Ser Glu Asp Thr Ala Val Tyx Tyr Cys 

85 90 95 

Ala Arg Arg Arg lie Ala Tyx Gly Tyr Asp Glu Gly His Ala Met Asp 
100 105 110 

Tyr Trp Gly Gin Gly Thr Leu Val Thx Val Ser Ser 
115 120 

<2) INFORMATION FOR SBQ ID NO: 9: 

(i) SBQUBNCB CHARACTERISTICS: 

(A) LENGTH: 372 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDSDNBSS: double 

(D) TOPOLOGY: linear 

(ii) MOLBCDLB TYPB: cDNA 



(xi) SBQOBNCB DESCRIPTION: SBQ ID NO: 9: 

CAGGTGCAAC TAGTOCAGTC CGGCGCCGAA GTGAAGAAAC CCGGTGCTTC CGTGAAAGTC 60 

AGCTGTAAAA CTAGTAGATA CACCTTCACT QAATACACCA TACACTGGGT TAGACAGGCC 120 

25 CCTGGCCAAA GGCTCSGAGTG GATABGAGGT ATTAATCCTA ACAATGGTAT TCCTAACTAC 180 

AACCAGAACT TCAAGGGCCG GGCCACCTTG ACCGTAGGCA AGTCTGCCAG CACCGCCTAC 240 

ATQGAACTGT CCAGCCTGCO CTCOGA0QAC ACTGCAGTCT ACITCTQCOC CAOAAGAAOA 300 

ATCGCCTATQ OTTACGACGA GGGCCATGCT ATGGACTACT GGGGTCAAGG AACCCTTGTC 360 

30 

ACCGTCTCCT CA ^''^ 

(2) INFORMATION FOR SBQ ID NO: 10: 

(i) SBQUBNCB CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 
3S (B) TYPE: amiao acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

40 

(xi) SBQUBNCB DESCRIPTION: SBQ ID NO: 10: 

Gin Val Gin Leu Val Gin Ser Gly Ala Glu Val Lys Lye Pro Gly Ala 
15 10 15 

45 

Ser Val Lys Val Ser Cys Lys Thr Ser Arg Tyr Thr Phe Thr Glu Tyr 
20 25 30 

Thr lie His Trp Val Arg Gin Ala Pro Gly Gin Arg Leu Glu Trp He 

35 40 45 

SO i Gly Gly He Asn Pro Asn Asn Gly He Pro Asn Tyr Asn Gin Lys Phe 

50 55 60 

Lys Gly Arg Ala Thr Leu Thr Val Gly Lys Ser Ala Ser Thr Ala Tyr 
65 70 75 80 

ss 
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Mst 61u Leu Ser Ser Leu Arg Ser Glu Asp Thr Ala Val Tyr Phe Cys 
85 90 95 

Ala Arg Arg Arg He Ala Tyr Gly Tyr Asp Glu Gly His Ala Het Asp 

100 105 110 

Tyr Trp Gly Gin Gly Thr Leu Val Thr Val Ser Ser 
lis 120 

(2) INFORMATION FOR SBQ 10 NO: II: 

(i) SBQOBNCS CHARACTBRISTICS : 

(A) LENGTH: 372 base pairs 

(B) TYPB: nucleic acid 

(C) STRANDBDNBSS : double 

(D) TOPOLOGY: linear 

(ii) HDUCOLB TYPB: cDNA 



(xi) SBOfOBNCB DBSCRIPTION: 5EQ ID NO: 11: 

20 CAGGTGCAAC TASTGCAGTC CGGCGCCGAA GTGAAGAAAC CCGGTGCTTC OGTGAAAGTC 60 

AGCTGTAAAA CTAGTAGATA CACCTTCACT GAATACACCA TACACTGGGT TAGACAGGCC 120 

CCTGQCCAAA GGCTGGAGTO 6ATAQGAGGT ATTAATCCIA ACAATGGTAT TCCTAACTAC IBO 

AACCAGAAQT TCAAGG6CCG GGTCACCATC AC0GTA6ACA CCTCTGCCAa CACC6CCTAC 240 

25 

ATGGAACTGT CCAGCCTGCG CTCCGACSOAC ACTGCAGTCT ACTACTGCGC CAGAAGAAGA 300 

ATCGCCTATG GTTACGACGA GGGCCATGCT ATC3GACTACT GQGGTCAAQG AACCCTTGTC 360 

ACCQTCTCCT CA 372 
30 (2) INFORMATION FOR SBQ ID NO: 12: 

(i) SBgOENCB CHARACTBRISTICS: 

(A) LBNGTO: 124 anino acids 

(B) TYPE: amino acid 

(C) STRANDBDNBSS : single 

(D) TOPOLOGY: linear 

35 

(ii) MOLBCOLB TYPE: peptide 



(xi) SBQUENCB DBSCRIPTION: SBQ ID NO: 12: 

Gin Val Gin Leu Val Gin Ser Gly Ala Glu Val Lys Lys Pro Gly Ala 
15 10 15 

Ser Val Lys Val Ser Cys Lys Thr Ser Arg Tyx Thr Phe Thr Glu Tyr 
20 25 30 

Thr lie His Trp Val Arg Gin Ala Pro Gly Gin Arg Leu Glu Trp He 
35 40 45 

Gly Gly He Asn Pro Asn Asn Gly He Pro Asn Tyr Asn Gin Lys Phe 
50 55 60 

Lys Gly Arg Val Thr He Thr Val Asp Thr Ser Ala Ser Thr Ala Tyr 
65 70 75 80 

Met Glu Leu Ser Ser Leu Arg Ser Glu Asp Thr Ala Val Tyr Tyr Cys 
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85 90 95 

Ala Arg Arg Arg lie Ala Tyr Gly Tyr Asp Glu Gly His Ala Net Asp 
100 105 110 

5 

Tyr Trp Gly Gin Gly Thr Leu Val Thr Val Ser Ser 
115 120 

(2) INFORMATION FOR SBQ ID NO: 13: 

(1) SBQUBNCB CHARACTBRISTICS : 
TO (A) LENGTH: 372 base pairs 

(B) TYPB: nucleic acid 

(C) STRANDBDNBSS: double 

(D) TOPOLOGY; linear 



15 
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Hi) M3LBCDLB TYPB: CDKA 



(xi) SBQUBNCB DESCRIPTION: SEQ ID NO: 13: 

CAGQTGCAAC TAGTGCAGTC GG6G6C0SAA GTQAAGAAAC CCGGTGCrrC C6TGAAAGTC 60 

^ AGCTGTAAAA CTAGTAGATA CACCTTCACT GAATACACCA TACACTGGGT TAGACAGGCC 120 

CCTGGCCAAA GGCTGGAC5TG GATAGGAGGT ATTAATCCTA ACAATGGTAT TCCTAACTAC 180 

AACCAGAAGT TCAAGGGCCG GGTCACCATC ACCGTAGACA CCTCT6CCA6 CACCGCCTAC 240 

ATGGAACT6T CCAOCCTGOO CTC C GAGGAC ACTGCAGTCT ACXTCTGCGC CAGAAGAAGA 300 

25 

ATCGCCTATQ GTTACX3ACGA GGGCCATGCT ATGGACTACT GGGGTCAAGG AACCCTTGTC 360 

ACCGTCTCCT CA 372 
(2) INFORMATION FOR SBQ ID NO: 14: 

(i) SBQDBHCB CHARACTBRISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 

(C) STRANDBDNBSS: single 

(D) TOPOUX5Y: linear 

35 (ii) MOI^COLE TYPB: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

40 Oln Val Gin Leu Val Gin Ser Gly Ala Glu Val Lys Lys Pro Gly Ala 

15 10 15 

Ser Val Lys Val Ser Cys Lys Thr Ser Arg Tyr Thr Phe Thr Glu Tyr 
20 25 30 

Thx He His Trp Val Arg Gin Ala Pro Gly Gin Arg Leu Glu Trp He 
45 3 5 4 0 4 5 

Gly Gly He Asn Pro Asn Asn Gly He Pro Aen Tyr Asn Gin Lys Phe 
50 55 60 



Lys Gly Axg Val Thr He Thr Val Asp Thr Ser Ala Ser Thr Ala Tyr 
65 70 75 80 

Met Glu Leu Ser Ser Leu Arg Ser Glu Asp Thr Ala Val Tyr Phe Cys 
as 90 95 
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20 



35 



40 



45 



Ala Arg Arg Arg He Ala Tyr Gly T/r Asp Glu Qly His Ala Met Asp 
100 105 110 

Tyr Txp Qly Gin Gly Thr Leu Val Thr Val Ser Ser 
115 120 

(2) INFORMATION FOR SBQ ID NO: 15: 

(i) SBQDBNCB CKARACTBRISTICS : 

(A) LENGTH: 372 base pairs 

(B) TYPB: nucleic acid 

(C) STRAND80NBSS : double 

(D) TOPOXXXSY: linear 

(ii) MOLBCDLB TYPB: cDNA 



(Xi) SBQDBNCB DBSCRIPTIQN: SBQ ID NO: IS: 

CAGGTGCAAC TAOTOCAGTC CGGCGC06AA GTSAAGAAAC CCGGTGCTTC CGTGAAAGTC 60 

AGCTSTAAAA CTAGTOGATA CACCTTCACT GAATACACCA TACACTOGGT TAGACAGGCC 120 

CCTGGCCAAA GGCTGGAGTG GATAGGAGGT ATTAATCCTA ACAATGGTAT TCCTAACTAC ISO 

AACCAGAAGT TCAAGGGCCG GGTCACCATC ACCGTAGACA CCTCTGCCAG CACCGCCTAC 240 

ATGGAACT6T CCAGCCTGCG CTCCGAGGAC ACTGCAGTCT ACTACTOCGC CAGAAGAAGA 300 

25 ATCGCCTATG GTTACGACGA GGGCCATGCT ATGGACTACT GQGGTCAAGG AACCCTTGTC 360 

ACCGTCTCCT CA 372 

(2) INFORMATION FOR SBQ ID NO: 16: 

(1) SBQDBNCB CHARACTERISTICS: 
30 (A) XjBNGTH: 124 amino acids 

(B) TYPB: amino acid 
<C) STRANDBDHBSS : single 
(D) TOPOLOGY: linear 



(ii) MOLBCOLB TYPB: peptide 



(xi) SBQOBNCB DBSCRIPTION: SBQ ID NO: 16: 

Gin Val Gin Leu Val Gin Ser Gly Ala Glu Val Lys Lys Pro Gly Ala 
15 10 15 

Ser Val Lys Val Ser Cys Lys Thr Ser Gly Tyr Thr Phe Thr Glu Tyr 
20 25 30 

Thr He His Trp Val Arg Gin Ala Pro Gly Gin Arg Leu Glu Trp He 
35 40 45 

Gly Gly He Asn Pro Asn Asn Gly He Pro Asn Tyr Asn Gin Lys Phe 
50 55 60 

Lys Gly Arg Val Thr He Thr Val Asp Thr Ser Ala Ser Thr Ala Tyr 
65 70 75 80 

so *»1" Ser Ser Leu Arg Ser Glu Asp Thr Ala Val Tyr Tyr Cys 

85 90 95 

Ala Arg Arg Arg He Ala Tyr Gly Tyr Asp Glu Gly His Ala Met Asp 
100 105 110 

55 
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Tyr Trp Gly Gin Gly Thr Leu Val Thr Val Ser Ser 
1X5 120 

(2) INFORMATION FOR SBQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 220 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNBSS : single 

(D) TOPOLOGY: linear 

(ii) MOLBCDLB TYPE: peptide 



(xi) SBQDBNCB DESCRIPTION: SBQ ID NO: 17: 



Asp lie val Met Ser Gin Ser Pro Ser Ser Leu Ala Val Ser Val Gly 
15 10 15 

Glu Lye Val Thr Met Ser Cye Lys Ser Ser Gin Ser Leu I*u Tyr Ser 
20 25 30 

Arg Asn Gin Lye Asn Tyr Leu Ala Trp Phe Gin Gin Lys Pro Gly Gin 
35 40 45 

Ser Pro Lys Leu Leu He Phe Trp Ala Ser Thr Arg Glu Ser Gly Val 
50 55 60 

Pro Asp Arg Phe Thr Gly Ser Gly Phe Gly Thr Asp Phe Asn Leu Thr 
65 70 75 80 

He Ser Ser Val Gin Ala Glu Asp Leu Ala Val Tyr Asp Cys Gin Gin 
65 90 95 

Tyr Phe Ser Tyr Pro Leu Thr Phe Gly Ala Gly Thr Lys Leu Glu Leu 
100 105 110 

Lys Arg Thr Val Ala Ala Pro Ser Val Phe He Phe Pro Pro Ser Asp 
115 120 125 

Glu Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn 
130 135 140 

35 Phe Tyr Pro Arg Glu Ala Lys Val Gin Trp Lys Val Asp Asn Ala Leu 

145 ISO 155 160 

Gin Ser Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp Ser Lys Asp 
165 170 175 

Ser Ttir Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr 
40 180 185 190 

Glu Lys His Lys Val Tyr Ala Cys Glu Val Thr His Gin Gly Leu Ser 
195 200 205 



Ser Pro Val Thr Lys Ser Phe Asn Arg Gly Glu Cys 
210 215 220 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SBQOBNCB CHARACTERISTICS: 

(A) LENGTH: 453 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNBSS: Single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SSQOENCG DESCRIPTION: SEQ ID NO: IB: 

Val Gin Leu Oln OIn Ser Gly Pro Glu Leu Val Lys Pro Gly Ala Ser 
15 10 15 

val Lys Met Ser Cye Lys Thr Ser Arg Tyr Thr Phe Thr Glu Tyr Thr 
20 25 30 

He His Trp Val Arg Gin Ser His Gly Lys Ser Leu Glu Trp He Gly 
35 40 45 

Gly He Asn Pro Asn Asn Gly He Pro Asn Tyr Asn Gin Lys Phe Lys 
50 55 60 

Oly Arg Ala Thr Leu «ir Val Gly Lys Ser Ser Ser Thr Ala Tyr ifet 
65 70 75 80 

Glu Leu Arg Ser Leu rkiz Ser Glu Asp Ser Ala Val Tyr Phe Cys Ala 
65 90 95 

Arg Arg Arg He Ala Tyr Gly Tyr Asp Glu Gly His Ala Mfet Asp Tyr 
100 105 110 

20 Trp Gly Gin Gly Thr Ser Val Thr Val Ser Ser Ala Ser Thr Lys Gly 

115 120 125 

Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly 
130 135 140 



10 



15 



25 



Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val 

145 150 155 160 

Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe 
165 170 175 

Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val 
30 180 185 190 

Ttir Val Pro Ser Ser Ser Leu Gly Thr Gin Thr Tyr He Cys Asn Val 
195 200 205 



35 



40 



45 



SO 



55 



Asn His Lys Pro Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys 
210 215 220 

Ser Cys Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu 
225 230 235 240 

Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr 
245 250 255 

Leu Met He Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp Val 
260 265 270 

Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp Gly Val 
275 280 285 

Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gin Tyr Asn Ser 
290 295 300 

Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His Gin Asp Trp Leu 

305 310 315 320 

Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala Leu Pro Ala 
325 330 335 

Pro He Glu Lys Itir He Ser Lys Ala Lys Gly Gin Pro Arg Glu Pro 
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340 345 350 

Gin Val Tyr Thr Leu Pro Pro Ser Arg Glu Glu Met Thr Lys Asn Gin 
355 360 365 

5 

Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp lie Ala 
370 375 380 

Val Glu Trp Glu Ser Aan Gly Gin Pro Glu Asn Asn Tyr Lys Thr Thr 
385 390 395 400 

10 Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Pho Leu Tyr Ser Lys Leu 

405 410 415 

Thr Val Asp Lys Ser Arg Trp Gin Gin Gly Asn Val Phe Ser Cys Ser 
420 425 430 

Val ISet His Glu Ala Leu Hie Aan His Tyr Thr Gin Lys Ser Leu Ser 
IS 435 440 445 

Leu ser Pro Gly Lys 
450 

(2) INFORMATION FOR SEQ ID NO: 19: 

^ (i) SBQUBNCB CMARACTERISTICS : 

(A) LENGTH: 321 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDBDNBSS : double 
(D) TOPOLOGY: linear 



(ii) HOLBCOLB TYPE: cDNA 

(xi) SBQGBNCB DESCRIPTION: SBQ ID NO: 19: 

CGTACTGTGG CTGCACCATC TGTCTTCATC TTCCCGCCAT CTGAT6AGCA GTTGAAATCT 60 

GGAACTGCCT CT GT TGTGT G CCTGCTGAAT AACTTCTATC CCAGAGAGGC CAAAGTACA6 120 

TGGAASGTGG ATAACGCCCT CCAATOGGGT AACTCCCAG6 AGAGT6TCAC AQAGCAGGAC 180 

AGCAAGGACA GCACCTACAG CCTCAGCAGC ACCCT6ACGC TGA6CAAAGC AGACTAOGAG 240 

AAACACAAA6 TCTACGCCTG CGAAOTCACC CATCAGGGCC TGAGCTCGCC CGTCACAAA6 300 

AOCTTCAACA OGQaAOAGTO T 321 
(2) INFORMATION FOR SEQ ID NO: 20: 

40 (i) SBQfOBNCE CHARACTBRISTICS : 

(A) LBNGIH: 107 amino acids 

(B) TYPB: amino acid 

(C) STRANDBDNBSS: single 

(D) TOPOLOGY: linear 



25 



30 



35 



45 



50 



55 



(ii) HDLBCOLB TYPB: peptide 



(xi) SBQUENCB DESCRIPTION: SBQ ID NO: 20: 



Arg Thr Val Ala Ala Pro Ser Val Phe He Phe Pro Pro Ser Asp Glu 
15 10 15 

Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe 
20 25 30 
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Tyr Pro Arg Glu Ala Lys Val Gin 
35 40 

5 Ser Gly Asn Ser Gin Glu Ser Val 

50 55 

Thr Tyr Ser Leu Ser Ser Thr Leu 
65 70 

Lys His Lys Val Tyr Ala Cys Glu 
10 85 

Pro Val Ttir Lys Ser Phe Asn Arg 
100 

(2) INFORMATION FOR SBQ ID NO: 21: 

IS (i) SBQUBHCB CHARACTERISTICS: 

(A) LBMGTH: 990 base pairs 
<B) TYPB: nucleic acid 

(C) STRANDBDNBSS : double 

(D) TOPOLOGY: linear 

(ii) MOLBCDLB TYPB: CDRA 

20 



(xi) SBQOBNCB DESCRIPTION: SBQ ID NO: 21: 



25 


6CCTCCACCA 


AGGGCCCATC 


GGTCTTCCCC 


CTGGCACCCT CCTCCAAQAG CACCTCTGGG 


60 


GGCACAGCGO 


CCCTGGGCTG 


CCTGGTCAA6 


GACTACrrcC 


CCGAACCGGT GAGGGT6TCG 


120 




TGGAACTCA8 


GCGCCCTGAC 


CAGCGGCGTG 


CACACCTTCC 


CGGCTGTCCT ACA6TCCTCA 


180 




GGACTCTACT 


CCCTCAGCAG 


CGTGGTGACC 


GTGCCCTCCA 


GCAGCTTGGG CACCCAGACC 


240 


30 


TACATCTGCA 


ACGTGAATCA 


CAAGCCCAGC 


AACACCAAGG 


TGGACAAGAA AGTTGAGCCC 


300 




AAATCTTGT6 


ACAAAACTCA 


CACAT6CCCA 


CCGTGCCCAG 


CACCTGAACT CCTGGGGGGA 


360 




CCGTCAGTCT 




CCCAAAACCC 


AAGGACACCC 


TCATGATCTC CCGGACCCCT 


420 


35 


GAGGTCACAT 


GCGTGGTGGT 


GGACGTGAGC 


CACGAAGACC 


CTGAGGTCAA GTTCAACTGG 


460 




TACGTSGACG 


GCGTQQAOGT 


GCATAATGCC 


AAGACAAAOC 


CGCOGGAGGA GCAGTACAAC 


540 




A6CACGTACC 


GGGTGGTCA6 


CGTCCTCACC 


GTCCTGCACC AGGACTGGCT GAATGGCAAG 


600 




GAGTACAA6T 


GCAAGGTCTC 


CAACAAAGCC 


CrCCCAGCCC 


CCATCGAGAA AACCATCTCC 


660 


40 


AAAGCCAAAG 


GGCAGCCCCG 


AGAACCACAG 


GTGTACACCC 


TGCCCCCATC CCGGGAGGAG 


720 




ATGACCAAGA 


ACCAGGTCAG 


CCTGACCTGC 


CTGGTCAAAG 


GCTTCTATCC CAGCGACATC 


780 




GCCGTGGA6T 


GGGAGAGCAA 


TGGGCAGCCG 


GAGAACAACT ACAAGACCAC GCCTCCCGTG 


840 


45 


CTGGACTCCG 


ACGGCTCCTT 


CTTCCTCTAC 


A6CAAGCTCA CCGTGGACAA GAGCAGGT6G 


900 




CAGCAGGGGA 


ACffrcrrcrc 


AT6CTCCGT6 


ATGCATGAGG 


CTCTGCACAA CCACTACAC6 


960 




CAGAAGAGCC 




TCCGGGTAAA 






990 



(2) INFORMATION FOR SBQ ID NO: 22: 

50 

(i) SEQDBNCB CKARACTBRISTICS : 

(A) LENGTH: 330 amino acids 

(B) T7PB: amino acid 



Trp Lys Val Asp Asn Ala Leu Gin 
45 

Thr Glu Gin Asp Ser Lya Asp Ser 
60 

Thr Leu Ser Lye Ala Asp Tyr Glu 

75 ao 

Val Thr His Gin Gly Leu Ser Ser 
90 95 

Gly Glu Cys 
105 
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15 



20 



iC) STRANDBDHBS5: single 
(D) TOPOLOGY: linear 

(ii) MOLBCDLB lYPB: peptide 



(xi) SBgOENCB DBSCRIPTION: SEQ ID NO: 22.: 

Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys 

^0 1 5 10 15 

Ser rtir Ser Gly Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr 
20 25 30 

Phe Pro Glu Pro Val Thr Val Ser Trp Aen Ser Gly Ala Leu Thr Ser 
35 40 45 

Gly Val His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser 
50 55 €0 

Leu Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr 
65 70 75 80 

Tyr He Cys Aan Val Asn His Lys Pro Ser Asn Thr Lys Val Asp Lye 
85 90 95 

Lys Val Glu pro Lye Ser Cys Asp Lys Thr His Thr Cys Pro Pro Cys 
100 105 110 

Pro Ala Pro Glu Leu Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro 
115 120 125 

Lys Pro Lys Asp Thr Leu Met He Ser Arg Ttir Pro Glu Val Thr Cys 
130 135 140 

Val Val Val Asp Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp 
145 150 155 160 

Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu 
165 170 175 

Glu Gin Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu 
180 1S5 190 

His Gin Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn 

195 200 205 

Lys Ala Leu Pro Ala Pro He Glu Lys Thr He Ser Lys Ala Lys Gly 
210 215 220 

40 Gin Pro Arg Glu Pro Gin Val Tyr Ttir Leu Pro Pro Ser Arg Glu Glu 

225 230 235 240 

Met Thr Lys Asn Gin Val Ser Leu Thr Cys Leu Val Lye Gly Phe Tyr 
245 250 255 

Pro Ser Asp He Ala Val Glu Trp Glu Ser Asn Gly Gin Pro Glu Asn 
^ 260 265 270 

Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe 
275 280 285 

Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gin Gin Gly Asn 
290 295 300 



30 



35 



50 



55 



Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr 
305 310 315 320 
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Gin Lys Ser Leu Ser Leu Ser Pro Gly Lys 
325 330 

(2) INFORMATION PGR SBQ ID NO: 23: 

<i) SBQUBN CB CH ARACTERISTICS: 

(A) LENGTH: 427 base pairs 

(B) TYPB: nucleic acid 

(C) STRAMDBDNBSS : double 

(D) TOPOLOGY: linear 

(ii) MOLBCOLB TYPE: DNA <gencnic) 



(xi) SBQOBNCB DBSCRIPTION: SBQ ID NO: 23: 

AASCTIQCCG CCACCATOGA TTCACAGGCC CAGGTTCTTA TOTTACPQCC GCTATGGQTA 60 

TCTGGTACCT GTGGGGACAT TGTGATGTCA CAGTCTCCAT CCTCCCTAGC TGTGTCAGTT 120 

GGAGAGAAGG TTACTATQAS CTGCAAGTCC AGTCAGAGCC TTTTATATAG TCGTAATCAA 180 

20 AAOAACTACT TGGCCTGGTT CCAGCAGAAG CCAGGGCAGT CTCCTAAACT GCTGATTTTC 240 

TGGGCATCCA CTAGGGAATC TGGGGTCCCT GATCGCTTCA CAGGCAGTGG ATTTGGGACG 300 

GATTTCAATC TCACCATCAG CAGTGTQCAG GCTGAGGACC TG6CAGTTTA TQACTGTCAG 360 

CAATATTTTA GCTATCC6CT CACGITCGGT GCTGGGACCA AGCTGGAGCT GAAACGTGAG 420 

25 

TGGATCC 427 

(2) INFORMATION FOR SBQ ID NO: 24: 

(i) SBQUBNCB CKARACTBRISTICS : 

(A) LBNGTH: 133 amino acids 
3^ (B) TyPB: amino acid 

(C) STRANDEDNBSS : single 

(D) TOPOLOGY: linear 

(ii) MOLSCULB TYPB: peptide 

35 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 24: 

Met Asp Ser Gin Ala Gin Val Leu Met Leu Leu Pro Leu Trp Val Ser 
1 5 10 15 

40 

Gly Thr Cys Gly Asp He Val Met Ser Gin Ser Pro Ser Ser Leu Ala 
20 25 30 

Val Ser Val Gly Glu Lys Val Thr Met Ser Cys Lys Ser Ser Gin Ser 
35 40 45 

45 Leu Leu Tyr Ser Arg Asn Gin Lys Asn Tyr Leu Ala Trp Phe Gin Gin 

SO 55 60 

Lys Pro Gly Gin Ser Pro Lys Leu Leu lie Phe Trp Ala Ser Thr Arg 
65 70 75 80 

Glu Ser Gly Val Pro Asp Arg Phe Thr Gly Ser Gly Phe Gly Thr Asp 
50 85 90 95 

Phe Asn Leu Thr lie Ser Ser Val Gin Ala Glu Asp Leu Ala Val Tyr 
100 105 110 
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Abp Cvb Gin Gin Tyr Phe Ser Tyr Pro Leu Hir Phe Gly Ala Gly Thr 
115 120 125 

Lys h&ix Glu Leu Lye 
130 

(2) INFORMATION FOR SBQ ID NO: 25: 

(i) SBQOBNCB CKARACTBRISTICS : 

(A) LENGTH: 457 base pairs 

(B) TYPE: nucleic acid 

(C) STMMDBDNBSS: double 

(D) TOPOLOGY: linear 

(ii) M0LBCDI2 TYPE: DNA (genomic) 



(xi) SBQOBNCB DESCRIPTION: SBQ ID NO: 25: 
AAGCTTGCCG CCACCATGGQ ATGGAGCTOO GTCTTTCTCT TTCTCCTGTC AGGAACTGCA 60 
GGTGTCCTCT CTGAGGTCCA GCTGCAACAG TCTGGACCTG AGCTGGTGAA GCCTGGGGCT 120 

^ TCA(3TAAA3A TGTCCTQCAA GACITCTAaA TACACATTCA CTQAATACAC CATACACTQQ 180 

GTGAGACAGA GCCATGGAAA GAGCCTTGAG TGGATTGGAG GTATTAATCC TAACAATGGT 240 
ATTCCTAACT ACAACCAGAA GTTCAAGGGC AGGGCCACAT TGACTC3TAGG CAAGTCCTCC 300 

25 AGCACCGCCT ACATGOABCT CCGCAGCCTG ACATCTGAGG ATTCTQCGGT CTAnTCTGT 360 

GCAAGAAGAA GAATCGCCTA TGGTTACX5AC GAGGGCCATG CTATGGACTA CTGGGGTCAA 420 
GGAACCTCAG TCACCGTCTC CTCAGGTOAG TGGATCC 457 
(2) INFORMATION FOR SBQ ID NO: 26: 

30 

(i) SBQOENCB CHARACTERISTICS: 

(A) LENGTH: 143 amino acids 

(B) TYPE: amino acid 

(C) STRANDBDNBSS: single 

(D) TOPOLOGY: linear 

35 (ii) MOLBCDLE TYPE: peptide 



(xi) SEQOENCB DESCRIPTION: SBQ ID NO: 26: 

40 Met Gly Trp Ser Trp Val Phe Leu Phe Leu Leu Ser Gly Ttir Ala Gly 

15 10 15 

val Leu Ser Glu Val Gin Leu Gin Gin Ser Gly Pro Glu Leu Val Lys 
20 25 30 

Pro Gly Ala Ser Val Lys Met Ser Cys Lys Thr Ser Arg Tyr Thr Phe 

45 35 40 45 

Thr Glu Tyr Thr He His Trp Val Arg Gin Ser His Gly Lys Ser Leu 
50 55 60 



Glu Trp He Gly Gly He Asn Pro Asn Aen Gly He Pro Asn Tyr Asn 
65 70 75 80 

Gin Lye Phe Lys Gly Arg Ala Thr Leu Thr Val Gly Lys Ser Ser Ser 
85 90 95 
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Thr Ala Tyr Ifec Glu Leu Arg Ser Leu Tlir Ser Glu Asp Ser Ala Val 
100 105 110 

Tyr phe Cys Ala Arg Arg Arg lie Ala Tyr Gly Tyr Asp Glu Gly His 
115 120 125 

Ala Met Asp Tyr Trp Gly Gin Gly Thr Ser Val Thr Val Ser Ser 
130 135 140 

(2) INPORIATION FOR SBQ ID W: 27: 

(i) SBQUBKCB CHARACTERISTICS: 

(A) LENGTH: 8068 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : double 

(D) TOPOL0G7: linear 

<ii) MOLBCOLB TYPB: DKA (genomic) 



(xi) SBQUBNCB DESCRIPTION: SEQ ID MO: 27: 



20 


GAATTCCA9C 


ACACTGGCGG CCGTIACTAO TTATTAATAG TAATCAATTA 


CGGGGTCATT 


60 




AGTTCATAGC 


CCATATATGG AGTTCCGCGT TACATAACTT ACGGTAAAT6 


GCCCGCCTGG 


120 




CTGACCOCCC 


AACGACCCCC GCCCATTQAC GTCAATAATG ACGTATGTTC 


CCATA6TAAC 


xeo 


25 


GCCAATAGGG 


AcrrrccATT gaoqtcaatg gqtgqagtat ttacggtaaa 


CTGCCCACTT 


240 


GGCAGTACAT 


CAAC?rGTATC ATATGCCAAG TACGCCCCCT ATTGACGTCA 


AT6ACGGTAA 


300 




ATGGCCCGCC 


TGGCATTATG CCCAGTACAT GACCTTATGG gactttccta 


CTTGGCAGTA 


360 




CATCTACGTA 


TTAGTCATCG CTATTACCAT GQTGATGCGG nrrGGCAGT 


ACATCAATGG 


420 


30 


GCGTGGATAG 


CGGTTTGACT CACGGGGATT TCCAAGTCTC CACCCCATTG 


ACGTCAATGG 


480 




GAtfiTTGrrr 


TGGCACCAAA ATCAAOGGGA CTTTCCAAAA T6TCGTAACA 


ACTCGGCCCC 


540 




ATT6ACGCAA 


ATGGQCGGTA GGC6TGTAC6 GTGGGAQGTC TATATAABCA 


GAGClXXrriT 


600 


35 


ABTGAACCGT 


CA&ATCGCCT GGAGACGCCA TCCACGCTGT TTTQACCTCC 


ATA6AAGACA 


660 


CCGGG3UXX^ 


TCCAGCCTCC GCGGCCGGGA ACGGTGCATT GGAACGCGGA 


nCCCCGTQC 


720 




CAAGAGTGAC 


GTAAGTACOG CCTATAGAGT CTATAQGCCC ACCCCCrTGG 


CTTCTTAT6C 


700 




ATGCTATACT 


GnTTTGGCT TGGGGTCTAT ACACCCCCGC TTCCTCATGT 


TATAGGTGAT 


840 


40 


GGTATAGCTT 


AGCCTATAGG TGTGGGTTAT TGACCATTAT TGACCACTCC 


CCTATTGGTG 


900 




ACGATACTTT 


CCATTACTAA TCCATAACAT GGCTCTTTGC CACAACTCTC 


TTTATTGGCT 


960 




ATATGCCAAT 


ACACTGTCCT TCAGAGACT6 ACACGGACTC TGTATTTTTA 


CAGGATGGGG 


1020 


45 


TCTCATTTAT 


TATTTACAAA TTCACATATA CAACACCACC GTCCCCAGTG 


CCCGCAGTTT 


lOSO 




TTATTAAACA 


TAACGTGGGA TCTCCACGCG AATCTCGGGT ACGTGTTCCG 


GACATGGGCT 


1140 




CTTCTCCGGT 


AGOGGCGGiWG COTCTACATC CGAGCCCTQC TCCCAT6CCT 


CCA6CGACTC 


1200 




ATGGTCGCTC 


GGCAGCTCCT TGCTCCTAAC AGTGGAGGCC AGACTTAGGC 


ACAGCACGAT 


1260 


SO 


GCCCACCACC 


ACCAGTGTGC CGCACAAGGC CGTGGCGGTA G0GTATGT6T 


CTSAAAATGA 


1320 




GCTCGGGGAG 


CGGGCTTGCA CCGCTGACGC ATTTGGAAGA CTTAAGGCAG 


CGGCAGAAGA 


1380 
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40 



45 



50 



AC3A!K3CAG6C AGCTOAOITO 'I TC T G TTCTG ATAAGAGTCA GAGGTAACTC CCGTTOCGGT 1440 

GCTCTTAACG GTCGAGGGCA GTGTW3TCTC AGCAGTACTC GTTGCTGCCG CGCGCGCCAC 1500 

CAGACATAAT AGCTGACAGA CTAACAGACT GTTCCTrrCC ATGGGTCTTT TCTOCAGTCA 1560 

CCGTCCTTGA CACGCGTCTC GGOAAGCTTQ CCX3CCACCAT GOATTCACAG GCCCAGC5TTC 1620 

rrAOXSTTACT GCCQCTATGG GTATCTGOTA CCTGTGGGGA CMTCTOATO TCACAOTCTC 1680 

CATCCTCCCT AGCTQTQTCA GTTGGAGAfiA AGGTTACTAT GAGCTGCAAG TCCAGTCAGA 1740 

GCCTTITATA TTCTAGAAAT CAAAAGAACT ACTTGGCCTG GTTCCAOCAO AA8CCAGOOC 1800 

AGTCTCCTAA ACTGCTGATT TTCTGGGCAT CCACTAGGOA ATCTGGGGTC CCTGATCGCT 1860 

TCACAGOCAO TGGATITaaa AOaOATTTCA ATCTCACCAT CAGCAGTOTQ CaGGCTGAGG 1920 

ACXrrGGCAGT TTATGACTGT CAGCAATATT TTAGCTATCC GCTCACGTTC GGTGCTGGGA 1980 

CCAAGCTGGA GCTGAAACGT GAGTGGATCC ATCTGGGATA AGCATGCTGT TTTCTGTCTG 2040 

TCCCTAACAT GCCCTOTGAT TATGCGCAAA CAACACACCC AAGGGCAGAA CTTTGTTACT 2100 

TAAACACCAT CCTGTTTGCT TCTTTCCTCA GGAACTGTGG CTGCACCATC TGTCTTCATC 2160 

TTCCCGCCAT CIGATGAGCA GTTGAAATCT GGAACTGCCT CTGTTGTGTG CCTGCTGAAT 2220 

AACTECTATC GCAGAOAGGC CAAA6TACA0 TGQAAdGTGG ATAACGCCCT CCAAICGOGT 2280 

AACTCCCAGG AGAtHCTGAC AGAGCAGGAC AGCAAGGACA GCACCTACAG CCTCAGCAGC 2340 

ACCCTGACGC TGAGCAAAGC AGACTACGAG AAACACAAA6 TCTACGCCTG CGAAGTCACC 2400 

CATCAGGGCC TGAGCTOGCC OGTCACAAAG A6CTTCAACA GGGGAGAGTG TTAGAGGGAG 2460 
AAGTCCCCCC ACCTGCTCCT CAGTTCCAGC CTGACCCCCT CCCATCCTTT GGCCTCTGAC 2520 
CCrnrrCCA CAGGGGACCT ACCCCTATTG CGGTCCTCCA GCTCATCTTT CACCTCACCC 2580 
CCCTCCTCCT CCTTOGCTTT AATTATGCTA ATGTTGGAGG AGAATGAATA AATAAAGTGA 2640 
ATCTTTGCAC CTGTGGTQGA TCTAATAAAA GATATTTATT TTCATTAGAT ATGTGTGTTG 27O0 
GTrmTGTG TCCAGTGCCT CTATCTGGAG GCCAGCTAGG GCTGGCCTPG GGGGAGGGGG 2760 
AGGCCAGAAT GACTCCAAGIA GCTACAGGAA GOCAGGTCAG AGACCCCACT GOACAAACAG 2820 
TGGCTGGACT CTGCACCATA ACACACAATC AACAGGGGAG TGftGCTGGAA ATTPSCTAGC 2880 
GAATTCTTGA AGACGAAAGG GCCTCGTGAT ACGCCTATTT TTATAGGTTA ATGTCATGAT 2940 
AATAATCGTT TCTTAGAOGT CAGGTGGCAC TTTPCGGGGA AATGT6CG06 GAACCCCTAT 3000 
TTGTTTATTT TTCTAAATAC ATTCAAATAT GTATCCGCTC ATGAGACAAT AACCCTGATA 3060 
AATGCTTCAA TAATATTGAA AAAGGAAGAG TATGAGTATT CAACATTTCX: GTGTCGCCCT 3120 
TATTCCCTTT TTTGCGGCAT TTTGCCTTCC TGTTTITGCT CACCCAGAAA CGCTGGTGAA 3180 
AGTAAAAGAT GCTGAAGATC AGTTGGGTGC ACGAGTGGGT TACATCGAAC TGGATCTCAA 3240 
CAGCGGTAAG ATCCTTGAGA GTmCGCCC CGAAGAACGT TTTCCAATGA TGAGCACTTT 3300 
TAAAGTTCTG CTATGTGOCG CGGTATTATC CCGTGTTGAC GCCXXiGCAAG AGCAACTOGG 3360 
TCGCCGCATA CACTATTCTC AGAATGACTT GGTTQAGTAC TCACCAGTCA CAGAAAAGCA 3420 
TCTTACGGAT GGCATGACAG TAAGAGAATT ATGCAGTOCT QCCATAACCA TGAGTGATAA 3480 
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CACTQCGGCC AACTTACTTC TGACAACGAT CGGAGGACCG AAGGAGCTAA CCGCTTmT 3540 

GCACAACATG GGGOATCATG TAACTCOCCT TGATCGTTGG GAACCGGA6C TGAATGAAOC 3600 

CATACCAAAC GACGA6CGTG ACACCACGAT GCCTGCAGCA ATGGCAACAA C6TTGCGCAA 3660 

ACTATTAACT GGCGAACTAC TTACTCTAGC TTCCCGGCAA CAATTAATAG ACTGGATGGA 3720 

GGCGGATAAA 6TTGCAGGAC CACTTCTGCX3 CTCGGCCCTT CCGGCTCSGCT GGTTTATTGC 3780 

TGATAAATCT GOAGGCGGTO AOCGTGGGTC TCGOQGTATC ATT6CAGCAC TGGGGCCAGA 3840 

TG6TAA0CCC TCTCGTATOG TAOTTATCTA CACGAC6GGG AGTCAG6CAA CTTATOGATQA 3900 

ACGAAATAGA CAGATCGCTG AGATAGGTOC CTCACTGATT AAGCATTGGT AACTGTCAGA 3960 

CX!AAGTrTAC TCATATATAC TITAGATTGA TTTAAAACTT CATTITTAAT TTAAAAGGAT 4020 

CTAGGTGAAG ATCCmTTG ATAATCTCAT GACCAAAATC CCTTAACGTG AGTTTTCGTT 4080 

CCACTGAGCG TCAGACCCCG TAGAAAAGAT CAAAGGATCT TCTTGAGATC CTTTTTTTCT 4140 

GCGCGTAATC TGCTGCTTGC AAACAAAAAA ACCACCGCTA CXyiGCGGTGG TTTGTTTGCC 4200 

20 GQATCAAGAO CTACCAACTC TmTCCXtAA GGTAACTGGC ITCAGCAGAa CQCAjQATACC 4260 

AAATACTGTC CTTCTAGIXjT AGCCGTAGTT AGGCCACCAC TTCAAGAACT CTCTTAGCACC 4320 

GCCTACATAC CTC6CTCTGC TAATCCTGTT ACCA6TGGCT GCXXKXAGTG GCGATAAGTC 4380 

6TGTCTTACC GGGTIGGACr CAA6A0GATA GTrACCXSGAT AAGGCGCAGC GGT0G06CTG 4440 

25 

AACGGGGG6T TOOTGCACAC AOCCCAGCTT GGAGCOAACG ACCTACACCX3 AACTOAGATA 4500 

CCTACAGCGT GAGCTAT6AG AAAGCXKTCAC GCTTCCCGAA GGGAGAAAGO CGGACAGGTA 4560 

TCCGGTAA6C GGCAGGGTCG GAACAGGAGA GCGCACGAGG GAGCTTCCAG GGGGAAACGC 4 620 

30* CTGGTATCTT TATAGTCCTG TCGGGTTTCG CCACCTCTGA CTTGAGCGTC GATTTTTC?rG 4680 

ATGCTCGTCA GGGGGGCGGA GCCTATGGAA AAACGCCAGC AACGCGGCCT TrTTACGGTr 4740 

CCTGGCCTTT TGCTGGCCTT TTGCTCACAT b ' ri ' Cm ' CCI ' GCGTTATCCC CTGATTCTGT 4800 

GGATAACCGT ATTAOCGCCT TTGAGTGAGC TGATACCGCT CGCCGCAGCC GAACGACCGA 4860 

GOGCAGCGAG TCAGTGAGCG AGOAAGCGGA AGAGC6CCTG ATGOGGTATT TTCTCCTTAC 4920 

GCATCTGTGC GGTATTrCAC ACC6CATATG GTGCACTCTC AGTACAATCT GCTCTGATQC 4980 

C!GCATAGTTA AGCCAGTATA CACTCCGCTA TCGCTACGTG ACT6GGTCAT GGCTGCGCCC 5040 

CXSACACCCGC CAACACCCGC TGACGCGCCC TGACGG6CTT GTCTGCTCCC GGCATCCGCT 5100 

TACAGACAA6 CTOTGACCGT CTCOSGGAGC TGCATGTGTC AGAGGnTTC ACCGTCATCA 5160 

CCGAAACGCX5 CX3AGGCAGCT GTGGAATGTG TGTCAGTTAG GGTGTGGAAA GTCCCCAGGC 5220 

45 TCCCCAGCAG GCAGAAGTAT GCAAAGCATG CATCTCAATT AGTCAGCAAC CAGGCTCCCC 5280 

AGCAGGCAGA AGTAT6CAAA GCATGCATCT CAATTAGTCA GCAACCATAG TCCC6CCCCT 5340 

AACTCCGCOC ATCXXGCXCC TAACTCCGCC CAGTTCC6CC CATTCTCJCGC CCCATGGCTG 5400 

ACTAATmr TTTATTTATO CAGAGOCCGA GGCCX3CCTCG GCCTCTGAGC TATTCCAGAA 5460 

SO 

GTAGTGfiGGA GGCTITIYW OAGGCCTAGG CTTITGCAAA AAOCTAGCTT CACGCTGCCG 5520 

CAAGCACTCA GGGCGCAAGG GCT6CTAAAG GAAGCGGAAC AC6TAGAAAG CCAGTCC6CA 5580 
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6AAACGGTGC TQACCCCGCai TOAATOTCAG CTACTGGGCT ATCTGGACAA GGGAAAACX5C 5640 

AAGCGCAAAfi ASAAAGCAGG TAGCTTGCAG TGGGCTTACA TGGCGATAGC TAGACTGGGC 5700 

GGTTTTATGG ACAGCAAGCG AACCGGAATT GCCAGCTGGG GCGCCCTCTG GTAAGGTTGG 5760 

GAAOCCCTOC AAAOTAAACT GGATGGCTTT CTTGCC6CCA AGGATCTGAT GGCGCAGGGG 5820 

ATCAAGATCT GATCAAGAGA CAGGATGAGO ATGGTTTCGC ATSATIGAAC AAGATGGATT 5880 

GCACGCAGGT TCTC0GGCCX3 CTTGGGTGGA GAGGCTATTC GQCTATGACT GQ6CACAACA 5940 

GACAATOGGC TGCTCTGAT6 CCGCOGTOTT CCGGCTQTCA GCGCAGGGGC GCCOGGTTCT 6000 

TTTTGTCAAG ACCGACCTGT COGGTGCCCT GAATGAACTG CA6GACGAG6 CAGCGCGGCT 6060 

ATCGTQGCTQ GCCACGACGG GCGTrCCTTG CGCAGCTGTG CTCGACGTTG TCACTGAAGC 6130 

GGGAAGGGAC TGGCTGCTAT TGOGCXIAAGT GCCGGGGCAG GATCTCCTGT CATCTCACCT 6180 

TCCTCCTCCC GAGAAAGTAT CCATCATGGC TGATGCAATG CGGCGGCTGC ATACGCITQA 6240 

TCCGGCTACC TGCCCATTCG ACCACCAAGC GAAACATCGC ATCGAGCQAG CACGTACTCG 6300 

GATGGAAGCC GGTCrTGTCG ATCAGGATGA TCTGGACGAA GAOCATCAGG GGCTC6CGCC 6360 

A6CCGAACTQ TTOGCCAGGC TCAAGQCGCG CATGCCC6AC GGCGAGGATC TCOTCGTGAC 6420 

CCATGGCGAT GCCTGCTTGC OGAATATCAT GGTQGAAAAT GGCXXSCmT CTGGATTCAT 6480 

CGACTGTGGC CGGCTGGGTG TOGCXaOACOQ CTATCAGGAC ATAGGGTTGG CTACCCX3TGA 6540 

TATTGCTGAA QAGCTTGGCG OGQAATQGGC TGACCGCTTC CTCOTGCTTT ACGGTATCGC 6600 

CGCTCCCGAT TCGCAGCGCA TCGCXTTTCTA TCGCCTTCTT GACGAGTTCT TCTGAGCGGG 6660 

ACTCTGGGGT TCGAAATGAC CGACCAAGC6 ACGCCCAACC TGCCATCACG AGATTTCGAT 6720 

7CCACCX5CCG CCTTCTATGA AAGGTTGG6C TTCGGAATCG TnTCCGGGA CGCCQGCTGG 6780 

ATGATCCTCC AGCGCGGGGA TCTCATGCTG GAGTTCTTCG CCCACCCCXK5 GCTCGATCCC 6840 

CTCGCGAOTT GGTTCAGCTG CTQCCTGAGG CTGGACGACC TCGCX3GAGTT CTACXX3GCAG 6900 

TGCAAATCCG TCGGCATCCA GGAAACCAGC AGCGGCTATC CGCGCATCCA TGCCGCCGAA 6960 

CTGCAGGAGT GGGGAGGCAC GATGGCC6CT TTGGTCCCGG ATCTTTGTGA AGGAACCTPA 7020 

CTTCTGTQGT GTGACATAAT TGGACAAACT ACTTACAGAG ATTEAAAGCT CTAAGGTAAA 7080 

TATAAAATTT TTAAGTGTAT AATGTGTTAA ACTACTGATT CTAATPGTTT GTGTATTTTA 7140 

GATTCCAACC TATGGAACT6 ATGAAT6GGA GCAGTGGTGG AATGCCXTTA ATGAGGAAAA 7200 

CCTGTITTGC TCAGAAGAAA TGCCATCTAG TGATGATGAG GCTACTGCTG ACTCTCAACA 7260 

TTCTACTCCT CCAAAAAAGA AGAGAAAGGT AGAAGACCCC AAGGACTTTC CTTCAGAATT 7320 
GCTAAGTTTT TTGAGTCATG CTGTGTTTAG TAATAGAACT CTTGCTTGCT TTGCTATTTA 7380 
CACCACAAAG GAAAAAGCTG CACTGCTATA CAAGAAAATT ATGGAAAAAT ATTCTGTAAC 7440 
CTTTATAAOT AGGCATAACA GTTATAATCA TAACATACItS TTTTTTCTTA CTCCACACAG 7500 
6CATAGAGT6 TCTGCTATTA ATAACTATGC TCAAAAATTG TGTACCTTTA GCTTTTTAAT 7560 
TTGTAAAGGG GTTAATAAGG AATATTTGAT GTATAGT6CC TTGACTAGAG ATCATAATCA 7620 
QCCATACCAC AITTGTAGAG GTTTTACTTO CTTTAAAAAA CCICCCACAC CTCCCCCTGA 7680 
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ACCTGAAACA TAAAATGAAT GCAATTOTTG TT6TTAACTT GTTTATPGCA GCTTATAATG 7740 

GTTACAAATA AAGCAATAGC ATCACAAATT TCACAAATAA AGCATTTTTT TCACT6CA1T 7 BOO 

CTAGTTGTGG TTTGTCCAAA CTCATCAATG TATCTTATCA TGTCTGGATC TAATAAAASA 7860 

TATITATTXT CATTAGATAT GTGTGTTGGT rr m VIV i i; CAGTGCCTCT ATCTGGAGGC 7920 

CAQGTAGG6C TCSGCCTTGGG G6AGGGGGA0 GCXAGAATGSA CTCCAAGA6C TACSUSQAAGG 7960 

CAGGTCAGA6 ACCCCACTOG ACAAACAGTG GCrGGACTCT GGACCATAAC ACACAATCAA 8040 

CAGGGGAGTQ AGCTGGAAAT TTGCTAGC fi068 
(2) INFORmiTION FOR SBQ ID NO: 28: 

(1) SBQOENCE CKARACTBRISTICS: 

(A) ZiBNGTH: 239 amino acids 

(B) TYPE: amino acid 

(C) STRAHDBDHESS: single 

(D) TOPOliOGy: linear 

(ii) MOLECULB TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 28: 

Asp Ser Gin Ala Gin Val Leu Met Leu Leu Pro Leu Trp Val Ser Gly 
IS 10 15 

Ihr Cys Gly Asp He Val Met Ser Gin Ser Pro Ser Ser Leu Ala Val 
20 25 30 

Ser Val Gly Glu Lys Val Thr Met Ser Cys Lys Ser Ser Gin Ser Leu 
35 40 45 

30 Ser Arg Asn Gin Lys Asn Tyr Leu Ala Trp Pho Gin Gin Lya 

50 55 60 

^ Pro Gly Gin Ser Pro Lys Leu Leu He Phe Trp Ala Ser Tlir Arg Glu 

65 70 75 80 

Ser Gly Val Pro Asp Arg Phe Thr Gly Ser Gly Phe Gly thr Asp Phe 
3S 85 90 95 

Asn Leu Thr He Ser Ser Val Gin Ala Glu Asp Leu Ala Val Tyr Asp 
100 105 110 



10 



IS 



20 



25 



40 



Cys Gin Gin Tyr Phe Ser Tyr Pro Leu Thr Phe Gly Ala Gly Thr Lys 
115 120 125 

Leu Glu Leu Lys Arg Thr Val Ala Ala Pro Ser Val Phe He Phe Pro 
130 135 140 

Pro Ser Asp Glu Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu 
145 150 155 160 

^ Leu Asn Asn Phe Tyr Pro Arg Glu Ala Lys Val Gin Trp Lys Val Asp 

165 170 175 

Asn Ala Leu Gin Ser Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp 
180 185 190 



SO 



55 



Ser Lys Asp Ser Thr Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser Lys 

195 200 205 

Ala Asp Tyr Glu Lys His Lys Val Tyr Ala Cys Glu Val Thr His Gin 
210 215 220 
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Gly Leu Ser Ser Pro Val Thr Lys Ser Phe Asn Arg Gly GXu Cye 
32S 230 235 

(2) INFOKHATIOR FOR SBQ ID NO: 29: 

(i) SBQUBNCB CHARACTERISTICS: 

(A) LEISIGTH: 7731 base pairs 

(B) TYPB: nucleic acid 

(C) STRANDBDMBSS: double 

(D) TOPOLOGY: linear 

(ii) MOLBCOLB TYPB: DNA (genomic) 



(xi) SBQOSNCB DBSCRIPTION: SBQ ID NO: 29: 
TTQAAGACGA AAG6GCCTCQ TOATACGCCT ATTTTTATAG GTTAATGTCA TOATAATAAT 
GGTTTCTTAG ACGTCAGGTO GCACTTTTCG GGGAAATGTG CGCGGAACCC CTATTrGTIT 
ATrmCTAA ATACATTCAA ATATGTATCC GCTCATGAGA CAATAACCCT GATAAATGCT 
TCAATAATAT TGAAAAAGGA AGAGTATGAG TATTCAACAT TTCCGTGTCG CCCITATTCC 
CTTTITTGCG GCATITPGCC TTCCTGTnT TGCTCACCCA GAAACGCTGG TGAAAOTAAA 
AGATGCTGAA GATCAGTTGG GTGCACGAGT GGGTTACATC GAACTGGATC TCAACAGCGG 
TAA6ATCCTT GAGAGTTTTC GCCCCGAAGA ACOnTTCCA ATGATGAGCA CTTTTAAAGT 
TCTQCTATOT GQGGCGOTAT TATCCCGT3T TQACGCOOOO CAAGAOCAAC TCOGTCGCCQ 
CATACACTAT TCTCAGAATG ACTTGGrTGA GTACTCACCA GTCACAGAAA AGCATCTTAC 
GGATGGCATG ACAGTAAGAG AATTATGCAG TGCTGCCATA ACCATGAGTG ATAACACTGC 
GGCCAACTTA CTTCTGACAA CGATCGGAGG ACCGAAGGAG CTAACCGCTT mTGCACAA 
CATGGGGGAT CATGTAACTC GCCTTGATCG TTGGGAACCG GAGCTGAATG AAGCCATACC 
AAACGACGAG CGTGACACCA CQATGCCTGC AGCAATGGCA ACAACGTTGC GCAAACTATT 
AACTGGCGAA CTACTTACTC TAGCTTCCCG GCAACAATTA ATAGACTGGA TGGAGGCGGA 

TAAACrrreCA ggaccacttc tgcgctcggc ccttccggct ggctggttta ttgctgataa 

ATCTGGAGCC GGTGAGOGTG GGTCTCGCGG TATCATTGCA 6CACTGGGGC CAGATGGTAA 
GCCCTCCCGT ATOGTACmA TCTACACGAC GGGGAGTCAO GCAACTAT6G ATGAACGAAA 
TAGACAGATC GCTGA6ATAG GTGCCTCACT GATTAAGCAT TGGTAACTGT CAGACCAAGT 
TTACTCATAT ATACTTTAGA TTGATTTAAA ACTTCATTTT TAATTTAAAA GQATCTAGGT 
GAA6ATCCTT TTTOATAATC TCATGACCAA AATCCCTTAA CGTGAGTTTT CGTTCCACTG 
AGCGTCAGAC CCC6TAGAAA AGATCAAAGG ATCTTCTTGA GATCCmTT TTCTGCGCGT 
AATCTGCTGC TTGCAAACAA AAAAACCACC GCTACCAGCG GTGGTTrGTT TGCCGGATCA 
AGAGCTACCA ACTCTTTTTC CGAAGGTAAC TGGCTTCAGC AGAGCGCAGA TACCAAATAC 
TGTCCTTCTA GTGTAGCCGT AGTTAGGCCA CCACTTCAAG AACTCTGTAG CACCGCCTAC 
ATACCTOGCT CTGCTAATCC TGTTACCAGT GGCTGCTOCC AGTGGCGATA AGTCGTGTCT 
TACCGGGTTG GACTCAAGAC GATAGTTACC QGATAAOGCO CAGCGGTCGG GCTGAACOGO 
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G0G1TC6T6C ACACA6CCCA GCTIGGA6CG AAC6ACCTAC ACCGAACTGA GATACCTACA 1620 

GCGTX5AGCTA TGAGAAAQCXS CCACGCTTCC CGAAGGGAGA AAGGCGGACA GGTATCCGGT 1660 

AAGCXX3CAGQ GTCGGAACAQ GAGAGCGCAC GAGGGAGCTT CCAGGGGGAA ACGCCTGGTA 1740 

TCTTTATAGT CCTGTCGGGT TTOGCCACCT CTGACTOAG CGTCGATTTT TGTGATCCTC 1800 

GTCAGGGGGG CGGAGCCTAT CXSAAAAACGC CAOCAACGCG GCCTTTTTAC GGTTCCTGGC 1860 

CmrGCTGO CCTTTTGCTC ACATGTrCTT TCCTOCGFTTA TCXTCCTGATT CIGrGOATAA 1920 

CCXnATTACC GCCTTTOAGT GAGCTGATAC CGCTCGCCGC AGCCGAACGA CCGAGCGCAG 1980 

CGAGTCAGTQ AOCQAGGAAG CX3QAAGA006 CCTGATGGGG TATTTTCTCC TTAOGCATCT 2040 

OTCCOOTATT TCACACOGCA TATGGTOCAC TCTCWJIACA ATCT0CTCT6 ATOCCOCATA 2100 

GTTAAGCXrAG TATACACTCC GCTATC6CTA C6TGACTGGG TCATGGCTGC GCCCCGACAC 2160 

CCGCCAACAC CCGCTGACXX: GCCCTSACGG GCTTOTCTGC TCCCGGCATC CGCTTACAGA 2220 

CAAGCTGTGA CCGTCTCCGO GAGCTGCATG TGTCAOAGGT TTTCACCQTC ATCACCGAAA 2280 

20 CGCGCGAGGC AGCATGCATC TCAATTAGTC AGCAACCATA GTCCCGCCCC TAACTCCGCC 2340 

CATCCCSCCC CTAACTCCOC CCAEnTCCGC CCATTCTCCO CCCCATGOCT GACTAATTTT 2400 

TTTTATTTAT GCAGAGGCCG AGGCOGCCTC GGCCTCTGAS CTATTCCA6A AGTAGTGAGG 2460 

AGGCrmrX GGAGGCCTAO GCTTTTGCAA AAAGCTAGCT TAGAGCTCAG GGCTGCGATT 2520 

TCGCGCCAAA CTTQACGGCA ATCCTAOOGT OAAQQCrGGT AGGATZTTAT CCXXX^TTGCC 2580 

ATCATGGTTC GACCATTGAA CTGCATCGTC GCCGTGTCCC AAAATATGGG GAirGGCAAG 2640 

AACGGAGACC TACCCTGGCC TCCGCTCAGG AACGAGTTCA AGTACTTCCA AAGAATOACC 2700 

ACAACCTCTT eAGTGGAAGG TAAACAGAAT CTGGTGATTA TGGGTAGGAA AACCTGGTTC 2760 

TCCATTCCTG AGAAGAATCG ACCTTTAAAG GACAGAATTA ATATAGTTCT CAGTAGAGAA 2820 

CTCAAAGAAC CACCAC6AG0 AGCTCATTTT CTTGCCAAAA GTTTGQATGA T6CCTTAAGA 2880 

CTTATTOAAC AACCXSGAATT GGCAAGTAAA GTAGACATGG TZTGGATAGT CGGAGGCAGT 2940 

TCTGTTIACC AflOAASCCar GRATCAACCA OGCCACCTCA GACTCTTTCT GACAAGGATC 3000 

ATGCAGGAAT TTQAAAGTQA CACX5TT1TTC CCAGAAATTG ATTT6GGGAA ATATAAACTT 3060 

CTCCCAGAAT ACCCASGCGT CCTCTCTGA6 GTCCAGGAGG AAAAAGGCAT CAAGTATAAG 3120 

^ TTTGAAGTCT ACOAGAflGAA ASACTAACAG GAAGATGCTT TCAAOTTCTC TQCTCCCCTC 3180 

CTAAAGCTAT GCAriTlTAT AAGACCATGG GACTTTTGCT GGCTTTAGAT CTTTGTCAAG 3240 

GAACCTTACT TCTCTGGTGT GACATAATTG GACAAACTAC CTACAGAGAT TTAAAGCTCT 3300 

^ AAGGTAAATA TAAAATTTTT AAGTGTATAA TGTGTTAAAC TACTGATTCT AATTGTTTGT 3360 

GTATrTTAOA TTCCAACCTA TSGAACTGAT GAATGGGAGC AGT6GTGGAA TOCCTTrAAT 3420 

GAGGAAAACC TOnTTGCTC iWSAAGAAATG CCATCTA3TG ATQATGASGC TACTGCTGAC 3480 

TCTCAACATT CTACTCCTCC AAAAAAGAAG AGAAAGGTAG AAGACCCCAA GGACTTTCCT 3540 

TCAGAATTGC TAAGTTTTTT GAGTCATCCT GTGTTTAGTA ATAGAACTCT TGCTTGCTTT 3600 

GCTATTTACA CCACAAAGGA AAAAGCTGCA CTGCTATACA AGAAAATTAT GGAAAAATAT 3660 
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va 1 itw A ITvj 


CTGCCGwGwu 


uUCCACCAGA CATAaTAGwT 


laACAGACTTiA 


5620 




^Rrt A/^'IV ' 


L. A A A wwATQG 


GTCTTTTCPG 


CAGTCACCGT CCTT6ACACG 


CGTCTCGGGA 


5880 


5 




Pft PPft1Y3/2/2 ft 




TCnTCTCTT TCrCCTGTCA 


CKjAACTGCAG 


5940 




U XUr A AW L\» 


TY^ ft f2^2TW ft/* 
A \>/Uj\3 1 WWW9 


Cil^CAAwAksl 


CTGGACCTGA OCZTSGTGAAG 


CCTGGGGCTT 


6000 






U A WW AUwMnU 




AuACATTCaC TGAaTACAwC 


ATACACTQG6 


€060 


10 




WWnA\MJMiwiU 


AGCCTTGAGT 


wATIXjGAGO TATTAaTCCT 


AaCAaTGGTA 


6120 








TTCAAuOvSCA 


GGGCCACATT GACT6TAG0C 


AAGTCCTCCA 


6160 






LiA A\9UHu1v X W 


(AJWUiQZwTGA 


CATCTGAGGA 1TCT6CG6TC 


TATTTCTGTG 


6240 


IS 


A/llk &mk US 


AA i wuU wTAT 


OGTTACGACG 


AGQGCCATGC TATGGACTAC 


TGGGGTCAA6 


6300 


GAACCrrCAGT 


uA^< WU 1 WAWW 


TCAGvTQACfr 


GGATCCTCTG CGCCIGGGCC 


CAGCTCXlvlXJ 


6360 






uTuAwATuuC 


ACCACCTCTC 


TTGCAGCCTC CACCAAGGGC 


CCATCGGTCr 


6420 






ACwwlwwTwC 


AAGAGCACCT 


CTGGGGGCAC AGCGGCCCTG 


GGCTGCCTGG 


6480 






Li iwwwwuAA 


COGGTGACCKS 


TGTCGTGGAA CTCAGGC6CC 


CTGACCA6CG 


6540 






{^mCCCGGCT 


ulVl^ACAST 


CC7CAGGACT CTACTCCCTC 


AQCA6CGT6G 


6600 




A >avwLa WW A\3V> W 






AQJ^LTiACAT wTGCAACCtaXj 


AATCACAAQC 


6660 


25 




r*ft ftfs^TTVsnfti^ 


AAGAAAnj"*!^!^ 


AOCwCAAATw TTOTQACAAA 


ACTCACACAT 


6720 




VWVWM\*WW A V 


nrT'ftrar* ft nri* 

W WWJlfcJWJlW W A 




UVjUACCwaw ASX^TTTwCXw 


TTwCCwCCAA 


6780 






r«ftryf iif^wtfY* 
W>Al*w w 1 wAl w 


ATCTCCCGGA 


w wwL, l^aAuCvT CACATGwGTG 


UTGGTGGAC6 


6840 






ft«ar»fVTOftrt 


tf"A'C!Afttf'A"i'CA 


K^wvii/ i*i»ft ^^*f* nm^^nn/'MS'nn 
AfTTuOTACCST wjACQQCGTO 


Ik IkfVlIk 

GAG<?T\3CATA 


6900 


30 


A A V9WW#wUXHW 


ftftftfsrmrw 

aAtUjwWVjwvju 


^ft/2/2ft Jl/"*^ 


AuAACAGCAw GTAwwGGGTG 


^fW^<k J w< 

GTCAGCGTCC 


6960 






u W^W V.JHuniXfVL> 


iVTViW A \aAAl\jr 


(iCAAGGAGTA CAAGTGCAAG 


GTwTCCAACA 


7020 




#viU www A W WU> 


#vJWwWW^J%A W 


nftnaftftftpfft 


lUTwCAAAGC CAAAGuCtCAG 


CCCCGAGAAC 


7080 


3S 


f ftf ft^i i'l'li^k'n 


WMWWW A V>W WW 


f*pft*!W*r*iv2r' 

WWXh A WW WUVJVI 


iUjKJAljiATVMiw wAAGAACCAS 


GTCAGCCTGA 


7140 




WW A VW W A\9U A 


WMiwIUVJWA AW 


Aa aw WWnUVAl 


AWAAAAiwwCrT UwUsTA^jGiSAm 


AGCAATGGGC 


7200 




MuwwwUffwstm 


WfWW AMWnnU 


#tWW#^UWW AW 


wwol\jwX\9GA CXV^JUACajGC 




7260 


40 




'*ftf f^flf^T! 
WW A WMWWU Au 


fiAC^IUknJlRPft 
\mwncwwM3WA 


flf?l^(snf*nr!f*ft Rnrvsftftrv^ov^ 
WJ A\j\jwitHJV.i A uUlauAAw\jTW 


TTCTCATGCT 


7320 


pfifymftTY5f*ft 

W W w * * 


Aunww A W Aw 


r&rftftrr*ani» 
wmwmmwwmw a 


AWnwvaWiUwiA laAuWdwAwL 


CT6TCTCCG6 


7380 




GTAAATGAGT 


GGGACGGCCG 


GCAAGCCCCG 


CTCCCOGGGC TCTCGCOOTC 


GCACGAGGAT 


7440 




GCTTGGCA06 


TACCCCCTGT 


ACATACTTCC 


CGGGCQCCCA GCATGGAAAT 


AAAGCACCX36 


7500 


45 


ATCTAATAM 


AGATATITAT 


TTTCATTAGA 


TATCTCIXSW GGTTTTTTGT 


GTGCAGT6CC 


7560 




TCTATCTGGA 


GGCCAGGTAG 


GGCTGGCCTT 


GGGGGAGGGG GAGGCCAGAA 


TGACTCCAAG 


7620 




iVGCTACAGGA 


AGGCAGGTCA 


GAGACCCCAC 


TGGACAAACA GTGGCTGGAC 


TCTGCACCAT 


7680 


50 


AACACACAAT 


CAACAGGGGA 


GTGAGCTGGA 


AATTTGCTAG CGAATTAATT 


C 


7731 



(2) INFORMATION FOR SBQ ID KO: 30: 
(i) SBQUENCE CHARACTBRISTICS: 
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(A) LBNOTH: 472 amino acids 
<B) TYPE: amino acid 

(C) SmANDBDMBSS : single 

(D) TOPOLOGY: linear 

(ii) MOLBCDU lYPB: peptide 



(xi) SBQOBRCB DBSCRIPTZON: SBQ ID NO: 30: 



Met Gly Trp Ser Trp Val Phe Leu Phe Leu Leu Ser Gly Thr Ala Gly 
15 10 15 

Val Leu Ser Glu Val Gin Leu Gin Gin Ser Gly Pro Glu Leu Val Lys 
20 25 30 

IS Pro Gly Ala Ser Val Lys Met Ser Cys Lys Thr Ser Arg Tyr Thr Phe 

35 40 45 

Thr Glu Tyr Thr He His Trp Val Arg Gin Ser His Gly Lys Ser X^u 
50 55 60 

Glu Trp lie Gly Gly He Asn Pro Asn Asn Gly He Pro Asn Tyr Aen 

20 65 70 75 80 

Gin Lys Phe Lys Gly Arg Ala Thr Leu Thr Val Gly Lys Ser Ser Ser 
85 90 95 



Thr Ala Tyr Mat Glu Leu Arg Ser Leu Thr Ser Glu Asp Ser Ala Val 
100 105 110 

Tyr Phe Cye Ala Arg Arg Arg He Ala Tyr Gly Tyr Asp Glu Gly His 
115 120 125 

Ala Met Asp Tyr Trp Qly Gin Gly Thr Ser Val Thr Val Ser Ser Ser 
130 135 140 

Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr 
145 ISO 155 160 

Ser Gly Gly Thr Ala Ala Leu Gly Cye Leu Val Lys Asp Tyr Phe Pro 
165 170 175 

Glu Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val 

^ 180 185 190 

His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu Ser 
195 200 205 

Ser Val Val Ihr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr Tyr He 
210 215 220 



Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp Lys Lys Val 
225 230 235 240 

Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala 
245 250 255 

Pro Glu Leu Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro 
260 265 270 

Lys Asp Thr Leu Met He Ser Arg Thr Pro Glu val Thr Cys val val 
27S 280 285 

Val Asp Val Ser His Glu Asp Pro Glu Val Lye Phe Asn Trp Tyr Val 
290 295 300 

Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gin 
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305 310 315 320 

Tyr Aan Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu Hie Gin 
325 330 335 

Asp Trp Leu Asn Gly Lye Glu Tyr Lys Cys Lye Val Ser Asn Lye Ala 
340 345 350 

Leu Pro Ala Pro He Glu Lys Thr He Ser Lys Ala Lys Gly Gin Pro 
355 360 365 

Arg Glu Pro Gin Val Tyr Thr Leu Pro Pro Ser Arg Glu Glu Met Thr 
370 375 380 

Lys Asn Gin Val Ser Leu Hir Cye Leu Val Lys Gly Phe Tyr Pro Ser 

385 390 395 400 

IS Asp He Ala Val Glu Trp Glu Ser Asn Gly Gin Pro Glu Asn Asn Tyr 

405 410 415 

Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr 
420 425 430 

Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gin Gin Gly Asn Val Phe 
^ 435 440 445 

Ser Cys Ser Val Met His Glu Ala Leu His Aan His Tyr Thr Gin Lys 
450 455 460 



25 



Ser Leu Ser Leu Ser Pro Gly Lys 
465 470 

(2) INFORMATION FOR SBQ ID NO: 31: 

(i) SBQOBN CB CH ARACT5RISTICS ; 

(A) LENGTH: 339 base pairs 

(B) TYPB: nucleic acid 
30 (C) STRAMDBDNBSS : double 

(D) TOPOLOGY: linear 

(ii) MOLBCDLB TYPB: cDMA 



3S 



(xi) SBQDBNCB DESCRIPTION: SBQ ID NO: 31: 

GACATTGTGA TGACCCAATC TCCAGACTCT TTGGCTGTGT CTCTAGGGGA OAGGGCCACC 60 

ATCAACTGCA AGTCCAGTCA GAGCCmTA TATICTAGAA ATCAAAASAA CTACTTGGCC 120 

^ TGGTATCAGC AGAAACCAGG ACAGCCACCC AAACTCCTCA TCTTTTGGGC TAGCACTAGG 180 

GAATCTGGGG TACCTQATAG GTTCAGTGGC AGTGGGTTTG GGACAGACTT CACCCTCACC 240 

ATTAGCAGCC TGCAOGCTGA AGATGTGGCA GTTTATTACT GTCAGCAAXA TTTTAGCTAT 300 

45 CCGCTCACGT TCGGACAAGG GACCAAGGTG GAAATAAAA 339 
(2) INFORMATION FOR SBQ ID NO: 32: 

(i) SBQUBNCB CHARACTERISTICS: 

(A) LENGTH: 113 amino acids 

(B) TYPB: amino acid 

SO iC) STRAIIDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLBCOLB TYPB: peptide 
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(xi) SBQUBNCB DESCRIPTION: SBQ ID NO: 32: 

Aso He Val Mtet Thr Oln Ser Pro Asp Sor Leu Ala Val Ser Leu Oly 
15 10 15 

Glu Arg Ala Thr He Asn Cys Lya Ser Ser Gin Ser Leu Leu Tyr Ser 

20 25 30 

Arg Aan Gin Lya Aan Tyr Leu Ala Trp Tyr Gin Gin Lys Pro Gly Gin 
35 40 45 

Pro Pro Lys Leu Leu He Phe Trp Ala Ser Thr Arg Glu Ser Gly Val 
SO 55 60 

Pro Asp Arg Phe Ser Oly Ser Gly Phe Oly Thr Asp Phe Thr Leu Thr 

65 70 75 80 

He Ser Ser Leu Gin Ala Glu Asp Val Ala Val Tyr Tyr Cya Gin Gin 
85 90 95 

Tyr Phe Ser Tyr Pro Leu Thr Phe Gly Gin Gly Thr Lys Val Glu He 
100 105 110 

Lye 



(2) INFORMATION FOR SBQ ID NO: 33: 

(i) SBQUBNCB CHARACTBRISTICS : 

(A) LBNGTH: 113 amino acids 

(B) TVPB: amino acid 

(C) STRAMDBDNBSS : single 

(D) TOPOLOGY: linear 

(ii) MOLBCOLB TYPB: peptide 



(xi) SBQfUBHCB DBSCRIPTION: SBQ ID NO: 33: 

ASD He Val Met Thr Oln Ser Pro Asp Ser Leu Ala Val Ser Leu Gly 
15 10 15 

Glu Arg Ala Thr He Asn Cys Lys Ser Ser Gin Ser Leu Leu Tyr Ser 
20 25 30 

Arg Asn Gin Lys Asn Tyr Leu Ala Trp Phe Gin Gin Lys Pro Oly Gin 
35 40 45 

Pro Pro Lys Leu Leu He Phe Trp Ala Ser Thr Arg Glu Ser Gly Val 
50 55 60 

Pro Asp Arg Phe Ser Gly Ser Gly Phe Gly Thr Asp Phe Thr Leu Thr 
65 70 75 80 

He Ser Ser Leu Gin Ala Glu Asp Val Ala Val Tyr Asp Cys Gin Gin 
85 90 95 

Tyr Phe Ser Tyr Pro Leu Thr Phe Gly Gin Gly Ttir Lys Val Glu He 
100 105 110 

Lys 



(2) INFORMATION FOR SBQ ID NO: 34: 
(i) SBQUBNCB CHARACTBRISTICS: 
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(A) LBNGTH: 113 amino acids 

(B) TYPE: amino acid 

(C) STRANDBONBSS: single 

(D) T0POLCX3Y: linear 

(ii) M9LBCULB TYPB: peptide 



(xi) SBQDBNCR DESCRIPTION: SEQ ID NO: 34: 

Asp lie Val HBt Itir Gin Ser Pro Asp Eer Leu Ala Val Ser Leu Gly 

15 10 15 

Glu Arg Ala Thr lie Asn Cys Lys Ser Ser Oln Ser Leu Leu Tyr Ser 
20 25 30 

Arg Asn Gin Lys Asn Tyr Leu Ala Txp Tyr Oln Gin Lys Pro Oly Gin 
35 40 45 

Pro Pro Lys Leu Leu lie lyr Trp Ala Ser Thr Arg Glu Ser Gly Val 
50 55 60 

Pro Asp Arg Phe Ser Gly Ser Gly Phe Gly Thr Asp Phe Thr Leu Thr 
65 70 75 80 

He Ser Ser Leu Gin Ala Glu Asp Val Ala Val Tyr Tyr Cys Gin Gin 
&5 90 95 

Tyr Phe Ser Tyr Pro Leu Tlir Phe Gly Gin Gly Thr Lys Val Glu He 
100 105 110 

Lys 



(2) INFORMATION FOR SEQ ID NO: 35: 

30 (i) SBQDBNCE CHARACTERISTICS: 

(A) LENGTH: 8068 base pairs 

(B) TYPB: nucleic acid 

(C) STRANDBDNBSS : double 

(D) TOPOLOGY: linear 



(ii) MOLBCOLB TYPB: DRA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

GAATTCCAGC ACACTGGCGO CCGTTACTAG TTATTAATaG TAATCAATTA CQGGGTCATT 60 

AGTTCATAGC CCATATATGG AGTTCCXKXTT TACATAACTT ACGGTAAATG GCCCGCCTG6 120 

CTGACCGCCC AACGACCCCC GCCCATTGAC GTCAATAATG ACGTATGTTC CCATAGTAAC 180 

GCCAATAGGG ACTTTCCATT GACGTCAATG GGTGGAGTAT TTACGGTAAA CTGCCCACTT 240 

45 GGCAGTACAT CAA(3TGTATC ATATGCCAAG TACGCCCCCT ATTGACGTCA ATGACGGTAA 300 

ATGGCCCGCC TGGCATTATG CCCAGTACAT GACCTTATGG GACITTCCTA CTPGGCAGTA 360 

CATCTACGTA TTAGTCATCQ CH^ATTACCAT GGT6AT6CGG TTTTOGCAGT ACATCAATGG 420 

GCGTGGATAG CGOrTTGACT CAOKXXSATr TCCAAGTCTC CACCCCATTQ ACGTCAATGO 480 

GAGTTTGTTT TGGCACCAAA ATCAACGGGA CTTTCCAAAA TGTOGTAACA ACTCCGCCCC 540 

ATTGACGCAA ATGGGCGGTA GGC6TGTACG GTCSGGAGGTC TATATAAGCA GAGCTCGTTr 600 
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vx 1 1 1 i 1 1 \ j i\i 


TGCAGTGCCt 


CTATCTGtaAO 


QCCAuGTAQCi QCITXiCCTTS 


uGGGAGGGGG 


2760 






ivi'wi'/viv 1V/51V 
ijAL i lAi-AAuA 


(SCTACiujiVsAA 


/^^^jt ^«/»nv iv^iv^*A*^^iv/*p 
GGCAGGTCAG AGACCCCACT 


GGACAAACAG 


2820 


5 






IV ^ IV ^ flV A 1V*T*^ 

ACACACAATC 


AACAGGGGAG TGAGCTOGAA 


ATITGCTAGC 


2880 




(sAAX iXTi rUA 


IV^ IV^^ A SV A/3n 

WaACfaAAAbO 


GCCTCGTGAT 


ACGCCTA'A'AX TrJ^J^GGTTA 




2940 




AATAAi\sU^i A 


TCiTAGAOGT 


CA6GTGGCAC 


TTTTCGGGGA AATGTGCGCG 


GAACCCCTAT 


3000 


10 


TrGTTTATlT 


TTCTTAAATAC 


ATTCAAATAT 


GTATCGQCrC ATSAGACAAT 


AACCCTGATA 


3060 




AATBCTrrCAA 


TAATAITGAA 


AAAGGAAC3A0 


TATQA8TA1T CAACATTTCC 


GTGTCGCCCT 


3120 




TATTCCci'rr 


^TTGOGGCAT 


TTTGCCTTCC 


TGTTTTTGCT CACCCAGAAA 


CGCTGGTGAA 


31B0 




ACjTAAAAGAT 


6CTGAAGATC 


A8TFGG6T3C 


ACGmTTGGGT tacatcgaac 


TGGATCTCAA 


3240 


15 


CAGCGGTAAG 


ATCCTTGAQA 


GTTrTCGCCC 


CGAAGAACGT TTTCCAATGA 


TGAGCACTTT 


3300 




TAAAffTTCTG 


vrAl\3l\i%}CG 


CG^*ATTATC 


Cu\iA\f A~i\«AC GCCGGGCAAG 


ik^/4« fk/**rTVV^^ 

AGCAACTCGG 


3360 




TCajCCGCATA 


CACTATTCTC 


AGAATGACTT 


GGTTGAGTAC TCACCAGTCA 


Ik Ik Vk Ih 

CAGAAAAGCA 


3420 


20 


TC'i'i'ACGGAT 


uuCAlXiACAu 


n*iv IV niv ^ IV iv*ffvf* 


atgcagtgct gccataacca 


TGMjTGATAA 


3480 






AACTTACTTC 


iiv*iv/^iv K^/vflvm 

T6ACAACGAT 


CGGAGGACOG AAwGAGCTAA 


CCGCTTTTTT 


3540 




IjCACAACATB 


GCjOQATCATG 


TAAlCTCGCCT 


TGATCisA~A\«6 GAACCGGAGC 


TGAATGAAGC 


3600 


25 


CATACCAAAC 


GACGAGCuIu 


ACACCACGAT 


GCCTBCAGCA ATGGCAACAA 


CGTTGCGCAA 


3660 


ACTATTAACr 


QQOijAAlCTAC 


TTACTCTAOC 


TTCGCGGCAA CAA~AATAG 


ACIQGATGGA 


3720 




^jUvviUATAAA 


< •*I*tV *f**A/?/37\y^ 


CACTTCTCjCG 


CTCGGCCCTT CCGGCTGGCT 


GGTTTATTGC 


3780 




TxiATAAATLTT 


GGAGCCGGTG 


AuCGTGGGTC 


tcgcggtatc attgcagcac 


TGGGGCCAGA 


3840 


30 


TuGTAAGCCC 


TCCCGTATCG 


TAGTTATCTA 


CACGACGGGG AGTCAGGCAA 


CTATGGATGA 


3900 




li K IV ivn* fv 


/^IV/^1V»TW^/ WT^ * 

LJuiAlvJvKrivy 


AGATAGCSTGC 


CTCACTGATT AAGCATTGGT 


AACTGTCAGA 


3960 




CCAAG17TAC 


1V*P IV*I* IV /** 


TTTAGUVrrGA 


TTTAAAACTT CA7ITI7EAAT 


n«nm ik fk ik ik m 

TTAAAAGGAT 


4020 


35 




AlV^ A A I'l'l^ 


ATAAaCXCAT 


AIV^^IV IV IV IV*l*^ i'M'W|W|llVlV^/»Wl 

uACCAAAATC CCTTAACvTS 


AG^^TCG^T 


4080 




A wJmML L. viAJ 


n*iv R A Annn* 


^IvivivAimviiVwii «iu ■■■iwiflkAivnio 
CAAAGuATCT TLTA A\iAGAT^ 


CA'A'A'A'A'l'lVl' 


4140 






A\J\,. A\aV A i\3w 




AI,.WA!CCuwaA CCAA3\A9uT\9G 


TTTGT'TOGCC 


4200 






U A Jlik* wwllU Aw 


A A A 1 A\A^uAJi 


nrvpivA/vnrui/* T*iv^ivr'^iv/^iv^ 


wGCAuATAluC 


42oU 


40 




A VAJMi^AUA 


<MvV>VV>iMuA A 




CTGTAGCACC 








w A WwV A v.. A V9\» 


A>inAVwAVjA A 


iiC A uVj w X Vtx* A\9L. ViJUj a Vj 


(jL.uA a AAui a v. 










>-.>»rv .I ^^Vai^ A i\. 




OV* lAvV^jiO^ A \J 




45 


AACGGGGGGT 


TCGTGCACAC 


AGCCCAGCTT 


GGAGCGAACG ACCTACACCG 


AACTGAGATA 


4500 




CCTACAGCGT 


GAGCTATOAG 


AAAGCGCCAC 


GCTTCCCGAA GGGAGAAAG6 


CGGACAGGTA 


4560 




TCCGGTAAGC 


GGCAGGGTCG 


GAACAGGAGA 


6CGCACGAGG GAGCTTCCAG 


GGGGAAACGC 


4620 


SO 


CPGGTATCTT 


TATAfiTCCTG 


TCGGGTTTCG 


CCACCTCTGA CTTGAGCGTC 


GATTTTTGTG 


4680 


, ATGCTCGTCA 


GCSGGGGCGGA 


GCCTATGGAA 


AAACX3CCAGC AACGCGGCCT 


TTTTAOGGTT 


4740 




CCTGGCCTTT 


TGCTGGCCTT 


TTGCTCACAT 


GTTCTTTCCT GCGTTATCOC 


CTGATrCTGT 


4800 
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GGATAACCGT ATTACCGCCT TTGACTGAGC TGATACCGCT CGCCGCAGCC GAACGACCGA 4860 

GCGCAGCGAG TCAGTGAGCG AGGAAGCGGA AGAGCGCCTG ATGCGGTATT TTCTCCTTAC 4920 

GCATCTCTGC GGTATTTCAC ACCGCATATG GTGCACTCTC AGTACAATCT GCTCTC3ATGC 4$ 80 

CGCATACrrA AGCCAGTATA CACTCCGCTA TCGCTACGT6 ACTGGSGTCAT GGCIG06CCC 5040 

CGACACCCGC CAACACCCGC TGACGCGCCC TGACGGGCTT GTCTGCTCCC GGCATCCGCT 5100 

TACAGACAAG CIGTQACCQT CTCXXK^GAGC TGCATGTGTC AGA6GTTTTC ACGGTCATCA 5160 

CCQAAAC6CG CGAGQCAGCT GTGGAAIGTG TGTCAGTTAG GGTGTGGAAA GTCCCCAGGC 5220 

TCCCCAGCAG GCAGAAGTAT GCAAAOCftlG CATCTCAATT A{?TCAGCAAC CAOGCTCCCC 5280 

AGCAGGCAGA AGTATGGAAA GCAtGCATCT CAATTAGTCA 6CAACCATA6 TCCCGCCCCT 5340 

AACTCCGCCC ATCCCGCCCC TAACTCCGCC CAGTTCCGCC CATTCTCCGC CCCATGGCTG 5400 

ACTAATTTXT TTTATITATG CAGAGGCTGA GGCCGCCTCG GCCTCTGAGC TATICCAGAA 5460 

GTAGTCAGGA GGCTTTTTTG GAGGCCTAGO CTTTTGCAAA AAGCTAGCTT CACGCTGCCG 5520 

CAAGCACTCA GGGCX3CAAGG GCTGCTAAAG GAAGCGGAAC ACGTAGAAAG CCAGTCOGCA 5580 

GAAACGGIYSC TGACCCGG6A TCAATCStCACS CTACTGGGCT ATCTGGACAA GC3GAAAA06C 5640 

AAGCGCAAAG AGAAAGCAGG TAGCTTGCAG TGGGCTTACA TGGCGATAGC TAGACTGGGC 5700 

GGTTITATGG ACAGCAAOCG AACCGGAATT GCCAGCTGOG GCGCCCTCTG GTAAGGTTGG 5760 

GAAGCCCTQC AAAGTAAACT GOATGOCTTT CTTGCOGCCA AGGATCTGAT GOCQCAGGGO 5820 

ATCAAGATCT GATCAAGAGA CAGGATGAGG ATC6TTTCGC ATGATTGAAC AAGATGGATT 5880 

GCACGCAGOT TCTCCGGCCG CTTGGGTGGA GACGCTATTC GGCTATGACT GGGCACAACA 5940 

GACAATCGGC TGCTCTGATG CC5GCCGTGTT CCGGCTGTCA GCGCAGGGGC GCCCGGTTCT 6000 

TTTTGTCAAG ACCGACCTGT CCGGTGCCCT GAATGAACTG CAGGACGAGG CAGCGCGGCT 6060 

ATCGTGGCTG GCCACGACGG GCGTTCCTTG CGCAGCTGTG CTCGACGTTG TCACTGAAGC 6120 

GGGAAGGGAC TGGCPGCTAT TGGGCGAA6T GCOGGGGCAG GATCTCCTGT CATCTCACCT 6180 

TGCTCCTGCC GAGAAAGTAT CCATCATGGC TGATGCAATG CGGCGGCTGC ATACXXTTTGA 6240 

TCOGGCTACC TGCCCMTCG ACCACCAAGC GAAACATCGC ATCGAGCGAG CACGTACTCG 6300 

GATXK3AAGCC GGTCTTOTCO ATCAGGATGA TCTOGACXSAA GAGCATCAGG GOCTOGOGCC 6360 

AGCCGAACTG TTCGCCAGGC TCAAGGCGCG CATGCCCGAC GGCGAGGATC TCGTOGTGAC 6420 

CCATGGCGAT GCCTGCTTGC CQAATATCAT GGTGGAAAAT GGCCGCTTTT C?TGGATrCAT 6480 

CGACTGTQGC CX3GCTGGGTG TGQCGGACCG CTATCAGGAC ATAGCGTTGG CTACCCGTGA 6540 

TATTGCTGAA GAGCTTGGCG GCGAATGGGC TGACCGCTTC CTCGrGCTTT ACGGTATCGC 6600 

CGCTCCCGAT TCGCAGCGCA TOjCCTTCTA TCGCCTTCTT GACGAGTTCT TCTGAGCGGG 6660 

ACTCTGGGGT TCGAAATGAC CGACCAAGCX5 ACXKICCAACC TGCCATCACG AGATTTCGAT 6720 

TCCACCGCXTG CCTTCTATGA AAGGTTGGGC TTCGGAATCG TTTTCCGGGA CX3CCGGCTGG 6780 

ATGATCCTCC AGCGCGGGGA TCTCATGCTG GAGTTCTTCG CCCACCCCGG GCTCGATCCC 6840 

CTCGCGAGTT GOTTCAQCTO CTGCCTOAGG CTGGACOACC TCGCGGAGPP CTACCGQCA6 6900 
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TGCAAATCCG 


TCGGCATCCA 


GGAAACCA6C 


AGCGGCTATC CGCGCATCCA TGCCCCC3GAA 


€960 


CTGCAGGAGT 


GGGGAGGCAC 


GATGGCCGCT 


TTGGTCCCGG ATCTTTGTGA AGGAACCTTA 


7020 


CTTCTGTGGT 


GTOACATAAT 


TGGACAAACT 


ACCTACAGAG ATITAAAGCT CTAAGGTAAA 


7060 


TATAAAATTT 


TTAAGTGTAT 


AATOTGTTAA 


ACTACT6ATT OTAATTSTTT 


GTGTA3TTTA 


7140 


GATTCCAACC 


TATGGAACTO 


ATGAATGGGA 


GCAGTGGTGG AATGCCTTTA ATGAGGAAAA 


7200 




TCAGAAGAAA 


IGCCATCTAG 


TGATGAtXSAG 6CTACTGCTG ACTCTCAACA 


7260 


TTCTACTCCT 


CCAAAAAAGA 


AGAGAAAGGT 


AGAAGACCCC AASGJiCTTTC 


CTTCAGAATT 


7320 


GCTAA6TTTT 


TIGACTCATG 


CTGItjTTTAG 


TAATAGAACT CTTQCTTGCT 


TTGCTATTTA 


7380 


CACCACAAAG 


GAAAAAGCTG 


CACTQCTATA 




7440 


CTTTATAAGT 


AGGCATAACA 


GTTATAATCA 


TAACATACTG Vi'mTCVVA 


CTCCACACAG 


7500 


GCATAGAGTG 


TCTGCTATTA 


ATAACTATGC 


TCAAAAATTG TGTACCTTrA 


GCTTTTTAAT 


7560 


TTGTAAAGGG 


GTTAATAAGG 


AATATTTGAT 


GTATAGTGCC TTOACTAGAG 


ATCATAATCA 


7620 


GCCATACCAC 


ATTTGTAGAG 


GTTTTACTTG 


CTTTAAAAAA CCTCCCACAC 


CrCCCCCTGA 


7680 


ACCTGJUACA 


TAAAATQAAT 


GCAATlt/riTJ 


TTGTTAACTT GTTTATTGCA 


GCXTATAATS 


7740 


GTTACAAATA 


AA6CAAT!AGC 


ATCACAAATT 


TCACAAATAA AGCATTTTTT 


TCACTGCATT 


7800 


CTAGTTGTGG 


TTTGTCCAAA 


CTCATCAATG 


TATCTTATCA TGTCTGGATC 


TAATAAAAGA 


7860 


TA'nrrATTTT 


CATTAGAXAT 


GTGTGTTGGT 


'rrriTGTGTG CAGTGCCTCT 


ATCTGGAGGC 


7920 


CAGGTAGGGC 


TGGCCITGGG 


GGAGGGGGA6 


GCCAGAATGA CTCCAAGAGC 


TACAGGAAGG 


7980 


CAGGTCAGAG 


ACCCCACTGG 


ACAAACAGTG 


GCTGGACTCT GCACCATAAC 


ACACAATCAA 


8040 


CAGGGGAGTG 


AGCTGGAAAT 


TTGCTAGC 






6068 



(2) INFORMATION FOR SBQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 234 amino acids 

(B) TYPE: amino acid 

(C) STRANDBDRBSS : single 
<D) TOPOLOGY: linear 

(ii) MOLBCOLB TYPE: protein 



(xi) SBQGENCB DBSCRIPTIOT: SEQ ID MO: 36: 

Het: Glu Thr Asp Thr Leu Leu Leu Trp Val Leu Leu Leu Trp Val Pro 
15 10 15 

Gly Ser Ser Gly Asp lie Val Met Thr Gin Ser Pro Asp Ser Leu Ala 
20 2S 30 

Val Ser Leu Gly Glu Arg Ala Thr lie Asn Cys Lys Ser Ser Gin Ser 
35 40 45 

Leu Leu Tyr Ser Arg Asn Gin Lys Asn Tyr Leu Ala Trp Tyr Gin Gin 

50 55 60 

Lys Pro Gly Gin Pro Pro Lys Leu Leu lie Phe Trp Ala Ser Thr Arg 
65 70 75 80 
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Glu Ser Oly Val Pro Asp Arg Phe Ser Gly Ser Gly Phe Gly Thr Asp 
B5 90 95 

Phe Thr Leu Thr He Ser Ser Leu Gin Ala Glu Asp Val Ala Val Tyr 
100 105 110 

Tyr Cys Gin Gin TVr Phe Ser Tyr Pro Leu Thr Phe Gly Gin Gly Thr 
lis 120 125 

Lys Val Glu He Lys Arg Val Phe He Phe Pro Pro Ser Asp Glu Gin 
130 135 140 

Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr 
145 150 155 160 

Pro Arg Glu Ala Lys Val Gin Trp Lys Val Asp Asn Ala Leu Gin Ser 
165 170 175 

Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp Ser Lys Asp Ser Thr 
180 185 190 

Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu Lys 
195 200 205 

20 His Lys Val Tyr Ala Cys Glu Val Thr His Gin Gly Leu Ser Ser Pro 

210 215 220 

Val Thr Lys Ser Phe Asn Arg Gly Glu Cys 
225 230 
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(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SBQDBNCB CHARACTERISTICS: 

(A) LENGTH: 372 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDBDNESS: double 

(D) TOPOLOGY: linear 

(ii) HOLBCUU TYPE: CDNA 



(jci) SB^^IBNCB DESCRIPTION: SEQ ID NO: 37: 

CA6GTGCAAC TAGTGCAGTC CGGCGCCGAA GT6AAGAAAC CCGGTGCTTC CGTGAAAGTC 60 

AGCTGTAAAA CTAGTAGATA CACCTTCACT GAATACACCA TACACTGGOT TAGACAGGCC 120 

CCTGGCCAAA GGCTGGAGTG GATAGGAGGT ATTAATCCTA ACAATGQTAT TCCTAACTAC 180 

40 AACCAGAAGT TCAAGGGCCG GGCCACCTTQ ACCGTAGGCA AGTCTGCCAG CACCGCCTAC 240 

ATGGAACTGT CCAGCCTGCG CTCCGAGGAC ACTGCAGTCT ACTACTGCGC CAGAAGAAGA 300 

ATCGCCTATG GTTAOGACGA GGGCCATGCT ATGGACTACT GGGGTCAAGG AACCCTTGTC 360 

ACCOTCTCCT CA 372 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQOBNCB CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 
<C) STRANDBDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLBCDLB TYPE: peptide 
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<xi) SEQOBNCB DESCRIPTION: SEQ ID NO: 38: 

Gin Val Gin Leu Val Gin Ser Gly Ala Glu Val Lys Lys Pro Gly Ala 
15 10 15 

Ser Val Lys Val Ser Cys Lys Thr Ser Arg Tyr Thr Phe Tlir Glu Tyr 

20 25 30 

Thr lie His Trp Val Arg Gin Ala Pro Gly Gin Arg Leu Glu Trp He 
35 40 45 

Gly Gly He Asn Pro Asn Asn Gly He Pro Asn Tyr Asn Gin Lys Phe 
50 55 60 

Lys Gly Arg Ala Thr Leu Thr Val Gly Lys Ser Ala Ser Ttir Ala Tyr 
65 70 75 80 

Met Glu Leu Ser Ser Leu Arg Ser Glu Asp Thr Ala Val Tyr Tyr Cys 
85 90 95 

Ala Arg Arg Arg He Ala Tyr Gly Tyr Asp Glu Gly His Ala Biet Asp 
100 105 110 

Tyr Trp Gly Gin Gly Thr Leu Val Tfer Val Ser Ser 
115 120 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SBQOENCS CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 

(C) STRANDBDNE5S : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DBSCRIPTIOH: SEQ ID NO: 39: 

Gin Val Gin Leu Val Gin Ser Gly Ala Glu Val Lys Lye Pro Gly Ala 
X 5 10 15 

Ser Val Lys Val Ser Cys Lys Thr Ser Arg Tyr Thr Phe Thr Glu Tyr 

20 25 30 

Thr He His Trp Val Arg Gin Ala Pro Gly Gin Arg Leu Glu Trp He 
35 40 45 

Gly Gly He Asn Pro Asn Asn Gly He Pro Aen Tyr Asn Gin Lys Phe 
50 55 60 

Lys Gly Arg Ala Thr Leu Thr Val Gly Lys Ser Ala Ser Thr Ala Tyr 
65 70 75 80 

Mtet Glu Leu Ser Ser Leu Arg Ser Glu Asp Thr Ala Val Tyr Phe Cys 
85 90 95 

Ala Arg Arg Arg He Ala Tyr Gly Tyr Asp Glu Gly His Ala Mtet Asp 
100 105 110 

Tyr Trp Gly Gin Gly Thr Leu Val Thr Val Ser Ser 
115 120 

(2) INFORMATION FOR SBQ in NO: 40: 
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(i) SBQOBNCB CKARACTBRISTICS : 

(A) LEKGTH: 124 amino acido 

(B) TYPB: amino acid 

(C) STRANDBDNBSS : single 
<D) TOPOUXSY: linear 

(ii) MOLBCOLB TYPE: peptide 



10 (xi) SBQDBKCB DESCRIPTION: SEQ ID NO: 40: 

Gin Val Gin Leu Val Gin Ser Gly Ala Glu Val Lys Lye Pro Gly Ala 
15 10 15 



IS 



Ser Val Lys Val Ser Cys Lys Thr Ser Arg Tyr Thr Phe Thr Glu Tyr 
20 25 30 

■mr He His Trp Val Arg Gin Ala Pro Gly Gin Arg Leu Glu Trp He 
35 40 45 

Gly Gly He Asn Pro Aen Asn Gly He Pro Asn Tyr Asn Gin Lys Phe 
50 55 60 

20 Lys Gly Arg Val Ttir He Thr val Asp Thr Ser Ala Ser Thr Ala Tyr 

65 70 75 80 

Ifet Glu Leu Ser Ser Leu Arg Ser Glu Asp Thr Ala Val Tyr Tyr Cys 
85 90 95 

Ala Arg Arg Arg Tie Ala Tyr Gly Tyr Asp Glu Gly His Ala Mat Asp 
25 100 105 110 

Tyr Trp Gly Gin Gly Thr Leu Val Thr Val Ser Ser 
115 120 

(2) INFORMATION FOR SEQ ID MO: 41: 

^ (i) SBQDENCB CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 

(C) STRAHDBDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLBCOLS TYPE: peptide 



(xi) SBQOBNCB DESCRIPTION: SEQ ID NO: 41: 

Gin Val Gin Leu Val Gin Ser Gly Ala Glu Val Lys Lys Pro Gly Ala 
15 10 15 

Ser Val Lys Val Ser Cys Lys Thr Ser Gly Tyr Thr Phe Thr Glu Tyr 
20 25 30 

Thr He His Trp Val Arg Gin Ala Pro Gly Gin Arg Leu Glu Trp He 
35 40 45 

Gly Gly He Asn Pro Asn Asn Gly He Pro Aen Tyr Asn Gin Lys Phe 
50 55 60 

Lys Gly Arg Val Thr He Thr Val Asp Thr Ser Ala Ser Thr Ala Tyr 
65 70 75 80 

Met Glu Leu Ser Ser Leu Arg Ser Glu Asp Thr Ala val Tyr Tyr Cys 
85 90 95 
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Ala Arg Arg Arg lie Ala Tyr Gly Tyr Asp Glu Gly His Ala Met Asp 
100 105 110 

Tyr Trp Oly Gin Oly Tlur Leu Val Thr Val Ser Ser 
115 120 

(2) INFORMATION FOR SEQ ID ElO: 42: 

(i) SBQDBKCB CHARACTBRISTICS : 

(A) LBNGTH: 7731 base pairs 

(B) TyPK: nucleic acid 

(C) STRAMDBDNBSS : double 
(O) TOPOLOGY: linear 

(ii) MOLBCDLB TYPB: DNA (genomic) 



(xi) SBQUBNCE DBSCRIPTION: SEQ ID NO: 43: 
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TT6AAGACGA AAGGGCCTOG 


TGATAOGCXrr ATmTATAO 


GTTAAXOTCA 


TGATAATAAT 


60 


GGTTTCrTAG AC3t5TCAGGTG 


GCACTTTTCG GGGAAATGTG 


CGCGGAACCC 


CTAlTlVriT 


120 


ATTTTTCTAA ATACATTCAA 


ATATGTATCC OCTCATCSAGA 


CAATAACCCT 


GATAAATGCT 


180 


TCAATAATAT TGAAAAAGGA 


AGAGTATCAG TATTCAACAT 


TTCCGTGTCG 


CCCTTATTCC 


240 


CTTTTTTGC3G GCATTTTGCC 


TifXTlVrm TGCtCACCCA 


GAAACGCT6G 


TGAAAGTAAA 


300 


AOATGCIQAA GATCAOTTGG 


CTQCAOQAGT OGGTTACATC 


QAACIGGATC 


TCAACAOCGG 


360. 


TAAGATCCn GASAblTi'iV 


GCCCCXSAAGA ACGTTTrCCA 


ATGATGAGCA 


CTTTTAAAGT 


420 


TCTGCTATGT GG06G6GTAT 


TATCCC36TGT TGAC6CCG06 


CAAGAGCAAC 


TCGGTOGCOG 


480 


CATACACTAT TCTCAGAATG 


ACTTGGTTQA GTACTCACCA 


GTCACAGAAA 


AGCATCTTAC 


540 


GGATGGCATG ACAGTAAGAG 


AATTATGCAG TGCTGCCATA 


ACCATGAGTG 


ATAACACTGC 


600 


GGCCAACTTA CITCTGACAA 


CGATC66AGG ACCGAA6GAG 


CTAACCGCTT 


TTTTGCACAA 


660 


CATGGGGGAT CAT6TAACTC 


GCCTTGATCG TTGGGAACC6 


OAGCTGAATQ 


AAGCCATACC 


720 


AAACGAOGAO CGTGACACCA 


OGATGCXrXGC A6CAATGGCA 


ACAAOGTTGC 


GCAAACTATT 


780 


AACT6G06AA CTACTTACTC 


TAGCTTCCCG GCAACAATTA 


ATA6ACTGGA 


T6GAGGC6GA 


640 


TAAAGTTGCA GGACCACTTC 


TGCGCTCGGC CCTTCCGGCT 


GGCIGGTITA 


TIGCTGATAA 


900 


ATCTQGAGCC GGTQA6CGT8 


GGTCTCXKX3a TATCATTGCA 


GCACTGGGGC 


CAGATOGTAA 


960 


GCCCTCCCGT ATOGTAGTTA 


TCTACACGAC GGGGAGTCAG 


GGAACTATGG 


ATGAACGAAA 


1020 


TAGACAGATC GCTGAGATAG 


GTGCCTCACT GATTAAGCAT 


TGGTAACTGT 


CAGACCAAGT 


1080 


TTACTCATAT ATACTTTAGA 


TTGATTTAAA ACTTCATTTT 


TAATTTAAAA 


GGATCTAGGT 


1140 


GAAGATCCTT TITGATAATC 


TCATGACCAA AATCCCTTAA 


CGTGAGTTTT 


CX3TTCCACT0 


1200 


AGCGTCAGAC CCCGTAOAAA 


AGATCAAAGG ATCTTCTTQA 


GATCCrmT 


TTCTGCOOGT 


1260 


AATCTGCTGC TTGCAAACAA 


AAAAACCACC GCTACCAGCG 


GTGGTTTGTT 


T6CCGGATCA 


1320 


AGA6CTACCA ACTCTTTTTC 


CGAAGGTAAC TGGCTTCAGC 


AGAGCGCAGA 


TACCAAATAC 


1380 


TGTCCTTCTA GTGTAGCCCT 


AGTTAGGCCA CCACTTCAAG 


AACTCTGTAG 


CACCGCCTAC 


1440 


ATACCTCGCT CTGCTAATCC 


TGTTACCAGT GGCTOCTGCC 


AGTG6CGATA 


AGTCGTGTCT 


1500 
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TACCGGGTTG GACTCAAGAC GATAGTrTACC GGATAAGGOG CAGCGGTCGG GCTGAACGGG 1560 

GGGTTCGTGC ACACAGCCCA GCTTGGAGCG AACGACCTAC ACCGAACTOA GATACCTACA 1620 

GCGTGAGCTA TGAGAAAGCG CCACGCTTCC CGAAGGGAGA AAGGCGGACA GGTATCCGGT 1680 

AAGCGGCAGG GTCGGAACAO GAOAGOGCAC GAGGGAGCTT CCAQGGGGAA ACX3CCTGGTA 1740 

TCPTTATAGT CCTOTCXlGaT TTOGCCACCT CTOACTTQAO COTOOATTTT TGTOATOCrC 1800 

GTCAGGGGGG CGGAGCCTAT OGAAAAAOGC CA6CAAC606 GCXnTXTAC GGTTCCTGGC 1860 

CnTTGCTGG CCTTTTOCTC ACATGTTCTr TCCTGCGTTA TCCCCTGATT CTGTGGJITAA 1920 

CCGTATTACC GCCTTTGAGT GAGCTGATAC GGCTCGCCGC AOCGQAACGA OCOAOOGCawa 1980 

CXSAOTCAGTG AG06AGGAA6 CGGAAGAGCG CCTGATGCX3G TATTTTCTCC TTACGCATCT 2040 

GTGCGGTATT TCACACCGCA TATGGTOCAC TCTCAGTACA ATCTGCTCTQ ATQCCGCATA 2100 

GTTAAGCCAG TATACACTCC GCTATCGCTA CGTGACTGGG TCATGGCTGC GCCCCGACAC 2160 

CCGCCAACAC CCGCTGACSGC GCCCTQACGG GCTTGTCTGC TCCCGGCATC CGCTTACAGA 2220 

CAAGCTGIGA CCXSTCTCOGG GAGCTGCATG TCTCAGAGGT TTTCACCGTC ATCACCXUiAA 2280 

CGOGGGAGGC AGCATGCATC TCAATTAGTC AGCAACCATA GTCCCGCCCC TAACTCCGCC 2340 

CATCCCQGCC CTAACTCCXIC OCAGTTCCGC CCaTTCTCCa CCCCATGOCT OACTAATnT 2400 

TTTTATrTAT GCAGAGGCGG AGGCCGCCTC GGCCTCTGAG CTATTCCAGA AGTAGTGAGG 2460 

AQGCTTTTTT GGAGGCXTTAG GCTTTTGCAA AAAGCTAGCT TACAGCTCAG GGCTGCGATT 2520 

TCGCGCCAAA CTTOACGGCA ATCCTAGCGT GAAGGCTGCT AGGATTTTAT CXXCGCTGCC 2580 

ATCATGGTTC GACCATTGAA CTGCATCGTC GCCGTGTCCC AAAATATGGG GArPGGCAAG 2640 

AACGGAGACC TACCXZTGGCC TCCGCTCAGG AACGAGTTCA AGTACTTCCA AAGAATQACC 2700 

ACAACCTCTT CAGTGQAAOG TAAACAGAAT CTGGTGATCA TGGGTAGGAA AACCTGGTIC 2760 

TCCATTCCTG AGAM3AAXG6 ACCTTTAAAG GACA6AATTA ATATAGTTCT CAGTAGAGAA 2820 

CTCAAAGAAC CACCACGAGG AGCTCATTTT CTTGCCAAAA GTTTGGATGA TGCCTTAAGA 2880 

CTTATTQAAC AACOGGAATT GQCAAGTAAA GTAGACATG6 TTTGGATAGT CGGAGGCAGT 2940 

TCronTACC AGGAAGCCAT GAATCAACCA GGCCACCTCA GACTCTTTGT GACAAGGATC 3000 

ATGCAGGAAT TTGAAAGTGA CACCTnTTC CCAGAAATTG ATITGGGGAA ATATAAACTT 3060 

CTCCCAGAAT ACCCAGGCGT CCTCTCTGAG GTCCAGGAGG AAAAAGGCAT CAAGTATAAG 3120 

TTTGAAGTCT ACGAGAAGAA AGACTAACAG GAAGATGCTT TCAAGTTCTC TGCTCCCCTC 3180 

CTAAAGCTAT GCATmTAT AAGACCATGG GACTITTGCr GGCTTTAGAT CTTTGTGAAG 3240 

GAACCTTACr TCT3TGGT3T GACATAATTG GACAAACTAC CTACAGAGAT TTAAAOCTCT 3300 

AAGGTAAATA TAAAATTTTT AAGTGTATAA TGTGTTAAAC TACT6ATTCT AATTGTTrGT 3360 

GTATTTTAGA TTCCAACCTA TQGAACTGAT GAATGGGAiSC AGTGGTGGAA TGCCTTTAAT 3420 

GAGGAAAACC I ' G ' m ' i ' UC ' l ' C AGAAGAAATG CCATCTAGT6 ATGATGAGGC TACTGCTGAC 3480 

TCTCAACATT CTACTCCTCC AAAAAAQAAQ AGAAAGGTAG AAGACCCCAA GGACTTTCCT 3540 

TCAGAATTGC TAAGmTTT GAGTCATGCT GTGTTTAGTA ATAGAACTCT TGCTTGCTTT 3600 
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OCTATTTACA 


CCACAAAGQA 


>iMMHw^ l\jwi l» l\ii7rAxAUA 


AUAAAAAx Al 


vjuAAAAATAT 


3660 




TCJTtSTAACCT 






ACATACTX3TT 


■A"A"A'1'V:'A"1'ACT 


3720 


5 


CCACACAGGC 




xuw lAx litAx ivi(^Ax\>vrrc 


AAAAATTGTG 


TACCTTTAGC 


3780 




•I*1"I"ITAJ11*1"1' 

A * * A A A A 




xiUUJuiuuAA xATTTuATtjT 


ATAGTGCC^T 


GACTAGAGAT 


3840 








Vi^A'iiflAOGT ^nTACntSCT 


TTAAAAAAiOC 


TCCCACACCT 


3900 


10 






iwUu\aAAX\jC AAa AVl lVA A 


GTTAACITBT 


TTATTQCAGC 


3960 








wwlATiUiwAT wtCAAAA A A\; 


ACAAATAAAG 


CATTTTOTTC 


4020 




ACTGCA.'^TCT 


&IS'I*I^»*I¥JI 


Au AA*i..iwtMU i UATwUixxvTA 


TCT^ATCATG 


TCTGGATCTA 


4080 




ATAAAAfiATA 




X X/tta/ili\.l\>l l> A V7 X XUVS 1 1 X 


TTTBTGTGGA 


GTGCCTCTAT 


4140 


15 


CTGKIAGGCCA. 


VlwX4vk3\J^^ XU 




UAGAATGACr 


CCAAGAGCTA 


4200 








UULAL.l\JuAC AAACAGTGGC 


TGGACTCrGC 


ACCATAACAC 


4260 




fiT firl i ^ ifWI rl 


vMSlAJJUtfXutnU 


U AwiAAA i A A CK^AuCGAAT 


TCCAGCACAC 


TG6CGGC0GT 


4320 


20 






uAATXAiajGG VTTCAmTiGTT 


CATAGCCCAT 


ATA3X3GAGTr 


4380 








AAAAA\JwwUU oCCXwuCTuA 


CUGCCCAAOG 


ACCCCCGCCC 


4440 






MA<MnA\XnWVX 


AAVA'XWwiT AunTAACwCCA 


ATOBGQAcnnr 


TCCAITQAOG 


4500 


25 




Mwwjxnx X XMiw 


MHa A JUUU. AuV» \a«AIUVauw«A 


ijTACATCAAG 


TGTATCATAT 


4560 


GCCAAGTAC6 


CCCCCTATKi 


A WJ*#% AuM ^A^i9AitMJ«x\al9 




ATTAT6CCCA 


4620 




GTACATGACC 


A * X X 


-1 A WW XMV^X X\9 U^^iUlA^aiMAw 


xiUJuAAA AAu 


xxaatjuC xAr 






TACCATGGTG 


A'PGfW5T"T'T*T 

w^wwX XXX 


vlOwtuXJMJM wn*AAwu\yVA3 A 


uViA A AuiJuo r 


TXxaACTLJiLri^ 


4740 


30 


GGGATTTCCA 


'IM X ^ X W^JV^Vtf 


^V«MA A VaMwVj A U«AA\juI.XMAJ A 


rrGnTTGGC 


ACCAAAATCA 


4800 


< 


ACGGQA^VTT 


WiWntVW%X V3r X w 


O A JwUmAAw A\* wIjVvJUWVA lur 


ACajCAAATUG 


GCGCyTAGGGG 


4860 




TGTACGGTSS 


WIM^SX V» X4%XM 


**w<M^«*<MatW» A^iAJtA A AAbrA\ar 


AACUsTvAGA 


TCGCCTtlGAG 


4920 


35 


AG6CCATCCA 


%>WrXwA X X Xw 


Vr A^U% A JWJ AAIaAUUA«la\9 


wCvuATCCA 


GCCTCCXKX3Q 


4960 




COGQGAACGG 




IwlkSWiaUwAX AWL> wiAa Au(.*wAfiU 


AGTQACXSTAA 


GTACCGCCTA 


5040 




TA6A6TCTAT 


AGGCCCACCC 


WWXX^JVWAAw A AaAvUmAU^ 


TATACTGITT 


ITQGCnrrQQG 


5100 




GfTCTATACAC 


W wWWV^W A A WW 


A ^^%A V A l^tiM AunA VVAM 


TAGCi'iASCC 


TATAGGTGTG 


5160 


40 


GGlTATTGAC 


CAI^ATTGAC 


w#»wxwwv.wA^ X x^ux\a#u,AJ/« 


TAwi'A'X'CCAT 


TACTAATwCA 


5220 




TAACATGGCT 


crnxsccACA 


* W A W X X X^% X X W A M X X 


V3W WVi A MWAU 


A \> AVW. A A\»Au 






AGACTGACAC 


GGACTCrrGTA 


A A ^ A^^^ Jl^^ X^^^v^^w X 


>\X 1 XnA AAA A 


A«\jL.AAi\.A A v^A 


C O A o 


45 






CCAGTGCCCG CAfflTITTAT 


TAAACATAAC 


GTGGGATCTC 


5400 




CACGCGAATC 


TOGGGTACGT 


GTTCCGGACA TGGGCTCTTC 


TCCGGTAGCG 


GCGGAGCTTC 


5460 




TACATCOGAG 


CCCTGCTCCC 


ATGCCTCCAG CXSACTCATGG 


TCGCTCGGCA 


GCTCCTTGCT 


5520 




CXrrAACAGTG 


GAGGCCAGAC 


TTAGGCACAG CACGATGCCC 


ACCACCACCA 


GTGTGCCGCA 


5580 


SO 




CAAGGCCGTO 


GCX3GTAGG6T 


ATQTGTCTGA AAATGAGCTC 


GGGGAGCXKSG 


CTTGCACCGC 


5640 




TGACGCATTT 


GGAAGACTTA 


AGGCAGCGGC AGAAGAAGAT 


GCAGGCAGCT 


GAGriTGTTGT 


5700 
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GTTCTGATAA GAGTCAGAGG TAACTCOCGT TSCX3GT6CTG TTAACGGTG6 AGGGCAGTGT 
AGTCTCAGCA GTACTCGTTtS CTGCCXXXSCG CX5CCACCAGA CATAATAGCT GACAGACTAA 
CAGACTOTTC CTTTCCATGG GTCmTCTG CAGTCACCGT CCTTGACACG CGTCTCGGGA 
AGCTTGCCX3C CACCATGGAC TGGACCTGGC GCGTCmTTG CCTGCTCGCC CTGGCTCCTO 
GGGCCCACAG CCAGGTX3CAA CTGOTGCAOT CCGGCGCCQA AGTGAAGAAA CCCGOTQCIT 
CCGTGAAAGT CAGCTGTAAA ACTAGTAGAT ACACCTTCAC TGAATACACC ATACACTGGG 
TTAGACAGGC CCCTGGCCAA AGGCTGGAGT GGATAGGAG6 TATTAATCCT AACAATGGTA 
TTCCTAACTA CAACCAGAAG TTCAAQGGCC GGGCCMCTT GACCGTAGGC AAOTCTGCCA 
OCACCGCCTA CAT3GAACPG TCCAGCCTGC GCTCCGAGGA CACTGCAGTC TACTACTQCG 
CCAGAAGAAG AATOGCCTAT GGTTACGAC3 AGGGCCATGC TATQGACTAC TGGGGTCAAG 
GAACCCTTGT CACCGTCTCC TCAGGTtSAGT GGATCCTCTG CGCCTGGGCX; CAGCTCTGTC 
CCACACCGCG GTCACATGGC ACCACCTCTC TTGCAGCCTC CACCAAGGGC CCATCGGTCT 
TCCCCCTOGC ACCCTCCTCC AAGAGCACCT CTGGGGGCAC AGCGGCCCTG GGCTGCCTGG 
TCAAGGACTA CTTCCCCGAA CCGGTGAOGG TGT C QTGGAA CTCAGQCGCC CTGACCAGCB 
GCGTCCACAC CTTCCCGGCT GTCCTACAGT CCTCAGGACT CTACTCCCTC AGCAGCGTGG 
TGACCGTOCC CTCCAGCAGC TTGGGCACCC AGACCTACAT CTQCAACGTG AATCACAAGC 
CCAGCAACAC CAAGGTGGAC AAGAAAGTTO AGCCCAAATC TTOT3ACAAA ACTCACACAT 
GCCCyiCCGTC CCCAGCACCT GAACTOCTGG GGGGACCGTC AGTCTTCCTC TTCCCCCCAA 
AACCCAAGGA CACCCTCATG ATCTCCC3GGA CCCCTGAGGT CACATGCGTG GTGGTGGACG 
TGAGCCACGA AGACCCTGAG GTCAAGTTCA ACTGGTACGT GGACGGCGTG GAGGTGCATA 
ATGCCAAGAC AAAGCCGCGG GAGGAGCAGT ACAACAGCAC GTACCGGGTG GTCAGCGTCC 
TCACCGTCCT GCACCAGGAC TGGCTGAATG GCAAGGAGTA CAAGTGCAA6 GTCTCCAACA 
AAGCCCTCCC AGCCCCCATC GAGAAAACCA TCTCCAAAOC CAAAGQOCAG CCCCGAGAAC 
CACAGGTOTA CACCCTGCCC CCATCCCGGG A0GA6ATGAC CAAGAACCAO GTCAGCCTGA 
CCTOCCTGGT CAAAGGCrrC TATCCCAGCG ACATCGCCGT GGAGTGGGAG AGCAATGGGC 
AGCCGGAGAA CAACTACAAG ACCACGCCTC COGTGCPGGA CTCOGACGGC TCCTTCTTCC 
TCTACAGCAA GCTCACCGTG GACAAGAGCA GOTGGCAGCA GGGGAACGTC TTCTCATGCT 
CCGTGATGCA TGAGGCTCT6 CACAACCACT ACACGCAGAA GAGCCTCTCC CTGTCTCCGG 
GTAAATCACT GCXSACGGCCG GCAAGCCCCG CTCCCCGGGC TCTCGCGGTC GCACGAGGAT 
GCTTGGCACG TACCCCCTGT ACATACTTCC CX5GGCGCCCA GCATGGAAAT AAAGCACCGG 
ATCTAATAfiA AGATATTTAT TTTCATTAGA TATGTGTGTT GGTTTTTTGT GTGCAGTGCC 
TCTATCTCGA GGCCAGGTAG GGCTGGCCTT GGGGGAGGiSG GAGGCCAGAA TGACTCCAAG 
AGCTACAGGA AGGCAGGTCA GAGACCCCAC TG6ACAAACA GTGGCTGGAC TCTGCACCAT 
AACACACAAT CAACAGGGGA GTGAGCTGGA AATTTGCTAG CGAATTAATT C 
(2) INFORHATIQN FOR SBQ ID HO: 43: 



5760 

5820 

5880 

5940 

€000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7731 
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(i) SBQOBNCB CHARACTERISTICS: 

(A) LENGTH: 472 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDMESS: single 

(D) TOPOLOGY: linear 

(ii) MOLBCDLB TYPE: protein 



(xi) SBQOBNCB DESCRIPTION: SBQ ID NO: 43: 

Met Asp Ttp Thr Trp Arg Val Phe Cys Leu Leu Ala Val Ala Pro Gly 
15 10 15 

Ala His Ser Gin Val Gin Leu Val Gin Ser Gly Ala Glu Val Lys Lys 
20 25 30 

Pro Gly Ala Ser Val Lye Val Ser Cys Lys Thr Ser Arg Tyr Thr Phe 
35 40 45 

Thr Glu Tyr Thr lie His Trp Val Arg Gin Ala Pro Gly Gin Arg Leu 
50 55 60 

Glu Trp He Gly Gly He Asn Pro Asn Asn Gly He Pro Asn Tyr Asn 
65 70 75 80 

Gin Lys Phe Lys Gly Arg Ala Thr Leu Thr Val Gly Lys Ser Ala Ser 
85 90 95 

Thr Ala Tyr Met Glu Leu Ser Ser Leu Arg Ser Glu Asp Thr Ala Val 
100 105 1X0 

Tyr Tyr Cys Ala Arg Arg Arg He Ala Tyr Gly Tyr Asp Glu Gly His 
115 120 125 

Ala Met Asp Tyr Trp Gly Gin Gly Thr Leu Val Thr Val Ser Ser Ser 

130 135 140 

Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr 
145 150 155 160 

Ser Gly Gly Thr Ala Ala Leu Gly Cys Leu Val Lye Asp Tyr Phe Pro 
165 170 175 

Glu Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val 
180 185 190 

His Thr Phe Pro Ala Val Leu Oln Ser Ser Gly Leu Tyr Ser Leu Ser 
195 200 205 

Ser val val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr Tyr He 

210 215 220 

Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp Lys Lys Val 

225 230 235 240 

Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala 
245 250 255 

Pro Glu Leu Leu Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro 
260 265 270 

Lys Asp Thr Leu Met He Ser Arg Thr Pro Glu Val Thr Cys Val Val 
275 280 285 

Val Asp Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr val 
290 295 300 
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Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gin 
305 310 315 320 

Tyr Aen Ser Thr Tyr Arg Val Val Ser Val Leu Ttir Val Leu His Gin 
325 330 335 

Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn Lys Ala 
340 345 350 

Leu Pro Ala Pro He Glu Lys Thr He Ser Lys Ala Lys Gly Gin Pro 
355 360 365 

Arg Glu Pro Gin Val Tyr Thr Leu Pro Pro Ser Arg Glu Glu Het Thr 
370 375 380 

Lys Asn Gin Val Ser Leu Thr Cye Leu Val Lyo Gly Phe Tyr Pro Ser 
385 390 395 *00 

Asp He jaa Val Glu Trp Glu Ser Asn Gly Gin Pro Glu Asn Asn Tyr 
405 410 415 

Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr 
420 425 430 

Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gin Gin Gly Asn Val Phe 
435 440 445 

Ser Cys Ser Val !tet His Glu Ala Leu His Asn His Tyr Thr Gin Lys 
450 455 460 

Ser Leu Ser Leu Ser Pro Gly Lys 
25 465 470 

(2) INFORMATION FOR SBQ ID NO: 44: 

(i) SBQUBNCB CHARACTERISTICS: 

(A) LBNGTH: 25 base pairs 

(B) TYPB: nucleic acid 
30 (C) STRANDBDNBSS: double 

(D) TOPOLOGY: linear 

(ii) HOLBCOLB TYPB: DMA (gencmic) 

35 

(xi) SBQOSMCB DSSCRXPTIM : SEQ ID NO: 44: 
ACCGTCTCCr CAGGTGAGTG GATCC 25 
(2) INFORMATION FOR SBQ ID NO: 45: 

^ Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNBSS : double 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: DNA (genomic) 



(xi) SBQDENCE DESCRIPTION: SBQ ID NO: 45: 
^ CCTCTCTTGC AGCC 

(2) INFORMATION FOR SEQ ID NO: 46: 
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(i) SSQDBHCB CKARACTBRI5TXCS : 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : double 

(D) TOPOLOGY: linear 

(ii) MOLECOLB TYPE: DKA (genomic) 



(xi) SEQOENCE DESCRIPTION: SBQ ID NO: 46 
CCTCTCTTGC AGCC 

(2) INFORMATION FOR SEQ ZD NO: 47: 

(i) SBQaBNCB CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDBDNBSS : single 

(D) TOPOLOGY: linear 

(ii) MOLSCOLB TYPE: peptide 



(xi) SEQC7BNCB DESCRIPTION: SBQ ID MO: 47 

Thr Val Ser Ser 

1 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDBDNBSS: single 

(D) TOPOLOGY: linear 

(ii) HOLBCDLB TYPE: peptide 



(xi) SBQDBNCB DESCRIPTION: SEQ ID NO: 48 
Ser Tttr Lys Gly 



(2) INFORMATION FOR SBQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNBSS: double 

(D) TOPOLOGY: linear 

(ii) HOLBCDLB TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49 
ACCGTCTCCT CAGCCTCCAC CAAGGGC 
(2) INFORMATION FOR SEQ ID NO: 50: 
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(i) SB<yiBKCB CHARACTBRZSTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPB: amino acid 

(C) STRANDBDNESS : single 

(D) TOPOUX3Y: linear 

(ii) MOLBCULB TYPE: peptide 



(xi) SBQOBNCB DBSCRIPTXON: SEQ XD MO: 50: 

Thr Val Ser Ser Ser Tbr Iiys Gly 
1 5 

(2) IMFORMATICai FOR SBQ ID NO: 51: 

(i) SBQOBNCB CHARACTERISTICS: 

(A) I«Btl6TH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS: double 
<D) TOPOLOGY: linear 

20 (ii) MOLBCDLB TYPB: cDNA 

(xi) SBQOBNCB DBSCRIPTXON: SBQ ID HO: 51: 
25 ACCGTCTCCT CAGCCTCCAC CAAGGGC 

(2) INFORMATION FOR SBQ ID NO: 52: 

(i) SEQDBNCB CHARACTERISTICS: 

(A) LBNGTH: 9 amino acids 

(B) TYPB: amino acid 

30 (C) STRANDBDNESS: single 

(D) TOPOLOGY: linear 

(ii) HOLBCOLB TYPE: peptide 

35 

(xi) SBQOBNCB DBSCRIPTION: SBQ ZD NO: 52: 

Thr Val Ser Ser Ala Ser Thr Lys Gly 
1 5 



(2) INFORMATION FOR SBQ ID IK): S3: 

(i) SBQOBNCB CHARACTBRISTICS: 

(A) LBNGTH: 22 base pairs 

(B) TYPB: nucleic acid 

(C) STRANDBDNBSS: double 

(D) TOPOLOGY: linear 

(ii) MOLBCDLB TYPB: DNA (genomic) 



(xi) SBQOBNCB DBSCRIPTION: SEQ ID NO: S3: 
GAAATAAAAC GT6AGTGGAT CC 
(2) INFORMATION FOR SEQ ID NO: 54: 
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(i) SBQOBNCB CHARACTBRISTICS : 

(A) LBNGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNBSS : double 

(D) TOPOIiOGy: linear 

(ii) M9LBCDZJS TYPE: PMA {genomic) 



(xi) SBQOBRCB DBSCRIPTION: SBQ ID NO: 54 
CTTCTTTCCT CAGGAACTGT OGCTGCA 
(2) INFORMATION FOR SBQ ID NO: S5 : 

(i) SBQOBNCB CHARACTBRISTIC5 : 

(A) X£NGTH: 4 amino acids 

(B) TYPB: amino acid 

' (C) STRANDBDNBSS : single 

(D) TOPOIOSY: linear 

(ii) MOLBCOLB TYPB: peptide 



(xi) SB^JBNCB DBS(3lIPnON: SBQ ID NO: 55 

Thx Val Ala Ala 

1 

(2) INFORMATION FOR SBQ ID NO: 56: 

(i) SBQOBNCB CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPB: nucleic acid 

(C) STRANDBDNBSS: double 

(D) TOPOIOOY: linear 

(ii) MOLBCOLB TYPB: DNA (genomic) 



(xi) SBQOBNCB DBSCRIPTION: SBQ ID NO: 56 
GAAATAAAAC GAACTGTGGC T6CA 
(2) INFORMATION FOR SBQ ID NO: 57: 

(i) SBQOBNCB CHARACTBRISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDBDNBSS: Single 

(D) TOPOLOGY: linear 

(ii) MDLSCDLB TYPB: peptide 



(xi) SBQOBNCB DBSCRIPTION: SBQ ID NO: 57 

Glu lie Lys Thr Val Ala Ala 
1 5 

(2) INFORMATION FOR SBQ ID NO: 58: 



81 



EP0953639A1 



(1) SEQOBNCB CKARACTBRISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRAIVDBDNBSS : double 

(D) TOPOIJOCT: linear 

<ii) MDLECDXiB TYPE: DNA (genomic) 



10 (xi) SEQOEHCB DBSCRIPTION: SBQ ID NO: 58: 

GAAATAAAAC GAACTGTGGC TGCA 24 

(2) INFORMATION FOR SBQ ID NO: S9 : 

(i) SBQUBNCB OWRACTBRISTICS : 
15 (A) LENGTH: 6 axtiino acids 

(B) TYPE: amino acid 

(C) STRANDBDNBSS: single ^ 

(D) TOPOLOCnr: linear 



20 



25 



30 



(ii) HOLBCDLB TYPE: peptide 



(xi) SBQOBNCB DESCRIPTION: SEQ ZD NO: 59: 

Glu lie Lys Arg Thr Val Ala Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SBQUBN CB CH ARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNBSS: single 

(D) TOPOLOGY: linear 

(ii) MOLBCDLB TYPE: peptide 



^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

Met Asp Ser Gin Ala Gin Val Leu Met Leu Leu Leu Leu Tip Val Ser 
15 10 15 



40 



Qly Thr Cys Gly 
20 

(2) INFORMATION FOR SBQ ID NO: 61: 

(i) SBQUBNCB CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNBSS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



SO 



55 



(xi) SBQUBNCB DBSCRIPTION: SBQ ID NO: 61: 

Met Gly Trp Ser Trp Val Phe Leu Phe Leu Leu Ser Gly Thr Ala Gly 
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Val Leu Ser 



(2) IMPORMATIOM FOR SBQ ID MO: 62: 

(i) SBgOBU CB CH ARACTBRISTICS : 
<A) LBHOTH: 9 base pairs 

(B) TTfPB: nucleic acid 

(C) STRANDEDNBSS: double 

(D) TOPOLOGY: linear 

(ii) HDLBCOLB TYPB: DNA (genomic) 



(xi) SBQUBZICS DBSCRIPTZON: SBQ ID NO: 62: 
GCCGCCACC 9 
(2) INFORMATION FOR SBQ ZD NO: 63: 

(i) SBQDBN CB CH ARACTBRISTICS: 

(A) LBM6TH: 37 base pairs 

(B) Typ B* nucleic acid 

(C) STRANDBDNBSS : double 

(D) TOPOLOGY: linear 

(ii) IfiDLBCOLB TYPB: Other nucleic acid 

(A) DBSCRIPTZON: /desc = •PRIMBR* 



(xi) SBQUBNCB DESCRIPTION: SBQ ID NO: 63: 
CAGAAAGCTT GCCGCCACCA TGGATTCACA GGCCCAG 37 
(2) INFORMATION FOR SBQ ID NO: 64: 

(i) SBQUBNCB CHARACTBRISTICS: 

(A) LBNGTH: 6 amino acids 

(B) TYPB: amino acid 

(C) STRANDBDNBSS: single 

(D) TOPOLOGY: linear 

(ii) MOLBCDLB TYPB: peptide 



(xi) SBQDBNCB DESCRIPTION: SBQ ID NO: 64: 

Hat Asp Ser Gin Ala Gin 
1 5 

(2) INFORMATION FOR SBQ ID NO: 65: 

(i) SBQUBNCB CHARACTBRISTICS: 

(A) LBNGTH: 35 base pairs 

(B) TYPB: nucleic acid 

(C) STRANDBDNBSS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

<A) DESCRIPTION: /desc « "PRIMER" 
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(xi) SBQUEHC3 DBSCRIPTIOii : SBQ ID NO: 65: 
CCGAGGATCC ACTCACGTTT CAGCTCCAGC TTGGT 
(2) INFORMATION FOR SBQ ID NO: 66: 

(1) SBQUBNCB CHARACTBRISnCS : 

(A) LBNOTH: 37 base pairs 

(B) TYPB: nucleic acid 

(C) STRANDBDNBSS : single 

(D) TOPOLOGY: linear 

(ii) HOLBCOLB TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "PRIMBR" 



(xi) SBQOKNCB DESCRIPTION: SBQ ID NO: 66: 
CAGAAAGCTT GCCGCCACCA TGGQATGGA0 CTGG6TC 
(2) INFORMATION FOR SBQ ID NO: 67: 

(i) SBQtTBNCB CHARACTBRISTICS : 

(A) LENGTH: 6 amino acids 

(B) TYPB: amino acid 

(C) STRANDBDNBSS: single 

(D) TOPOLOGY: linear 

(ii) MOLBCDLB TYPB: peptide 



(xi) SBOOBNCR DESCRIPTION: SBQ ID NO: 67: 
Met Gly Trp ser Trp Val 

1 5 
(2) INFORMATION FOR SBQ ID NO: 68: 

(i) SBQUBNCB CHARACTBRISTICS: 

(A) LBNGTH: 35 base pairs 

(B) TYPB: nucleic acid 

(C) STRANDBDNBSS: single 

(D) TOPOLOGY: linear 

(ii) MOLBCDLB TYPB: Other nucleic acid 

(A) DESCRIPTION: /deSC = "PRIMBR- 



(xi) SBQUBNCB DESCRIPTION: SEQ ID NO: 68: 
CCGAGGATCC ACTCACCTGA G6AGACGGT6 ACTGA 
(2) INFORMATION FOR SBQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LBNGTH: 36 base pairs 

(B) TYPB; nucleic acid 

(C) STRANDBDNBSS: single 

(D) TOPOLOGY: linear 

(ii) MOLBCDLB TYPB: other nucleic acid 

(A) DESCRIPTION: /deSC « "PRIMER" 
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(xi) SBQtJBRCB DESCRIPTION: SBQ ID NO: 69: 



GTCATCACAA TCTCTCCGOA GGAACCTGGA ACCCAG 



36 
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(2) INFORMATKM FOR SBQ ID NO: 70: 

(i) SBQOBliCB CHARACTBRISTIC3 : 

(A) LENGTH: 29 base pairs 

(B) TYPB: nucleic acid 

(C) STRANDBDNBSS : single 

(D) XXJPOLOGY: linear 

(ii) MOLBOniB lYPB: other nucleic acid 

(A> DBSCRIPTION: /desc « -PRIHBR" 



(xi) SEQOBNCB DBSCRIPTION: SBQ ID NO: 70: 
CTCCGGAGAC AnGTGATOA CCCAATCTC 29 
(2) INFORMATION FOR SBQ ID NO: 71: 

(i) SBQDBN CB CH ARACTERISTICS; 

(A) LBNGTH: 29 base pairs 

(B) TYPB: nucleic acid 

(C) STRANDBDNBSS : single 

(D) TOPOIOGY: linear 

(ii) MOLBCOLS TYPB: other nucleic acid 

(A) DESCRIPTION: /desc « "PRIMER" 



(xi) SEQOBNCB DESCRIPTION: SBQ ID NO: 71: 
CTCCGGAGAC ATTGTGATGA CCCAATCTC 29 
(2) INFORMATION FOR SBQ ID NO: 72: 

(i) SBQOEKCB CHARACTTERISTICS : 

(A) LBHCmi: 72 base pairs 

(B) TYPB: nucleic acid 

(C) STRANDBDNBSS: single 

(D) TOPOLOGY: linear 

(ii) MOLBCOLS TYPB: other nucleic acid 

(A) DBSCRIPTION: /desc » "PRIMER" 



(xi) SBQOENCB DBSCRIPTION: SEQ ID NO: 72: 
CAGTCAGAGC CTTTTATATT (TTAGAAATCA AAAGAACTAC TTGGCCTGGT ATCAGCAGAA 60 
ACCAGGACAG CC 72 
(2) INFORMATION FOR SBQ ID NO: 73: 

(i) SBQOBNCB CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPB: nucleic acid 

(C) STRANDBDNBSS: single 
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(D) TOPOLOCnf: linear 

(ii) MOLBCOU TYVB: Other nucleic acid 

(A) DBSCRIPTI(»I: /desc = "PRIMBR" 



(Xi) SBQOBNCB DBSCRIPTXON: SBQ ID HO: 13: 
ACCCCAGATT CCCTAGTQCT AGCCCAAM6 ATGAGGAGTT TGGG 
(2) ZNPORMATXON FOR SBQ ZD NO: 74: 

(i) SBQOBNCB CHARACTBRISTICS : 

(A) LBNGTH: 67 base pairs 

(B) TYPB: nucleic acid 

(C) STRAMDBDMBSS: single 

(D) TOPOLOGY: linear 

(ii) MOI^CDLB lYPB: other nucleic acid 

(A) DBSCRIPTIOH: /desc « "PRIMBR- 



(xi) SBQDBHCB DBSCRIPTIOH: SBQ ID NO: 74: 
TAOCACTAGG GAATCTGGG6 TACCTGATAG GTTCAGTGGC AGTGGGTTTG GGACAfiACTT 
CACCCTC 

(2) INPORMATION POR SBQ ID NO: 75: 

(i) SEQOSNCB CHARACTBRISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPB: nucleic acid 

(C) STRANDEDNBSS : single 

(D) TOPOLOGY: linear 

(ii) MOLBCnJLB TYPB: Other nucleic acid 

(A) DBSCRIPTIOH: /desc = •PRIHBR" 



(xi) SBQOBNCB DBSCRIPTION: SBQ ID NO: 75: 
tfiV CUTlV iX; CX3AACGIGA0 CG(3AXAGCrA AAATATTGCT GACAGTAATA AAC 
(2) INPORMATIOH FOR SEQ ID NO: 76: 

(i) SBQOBNCB CHARACTBRISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNBSS : single 

(D) TOPOLOGY: linear 

(ii) HOLBCOLB TYPB: other nucleic acid 

(A) DESCSavnOR: /desc m "PRIKBR* 



(xi) SBQOBNCB DESCRIPTION: SBQ ID NO: 76: 
GCTCACOTTC GGACAAGGXSA CCAAOCPRSGA AAT 
(2) INFORMATION FOR SEQ ID NO: 77: 
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(i) SEQOBKCB CHARACTBRISTICS : 

(A) LENGTH: 72 base pairs 

(B) TypK: nucleic acid 

(C) STRAMDBDNBSS: single 

(D) TOPOLOGY: linear 

Ui) MOLBCULB TYPB: Other nucleic acid 

(A) DESCRIPTION: /desc » "PRIMBR* 



(xi) SBQOBNCB DBSCRIPTION: SSQ ID NO: 77: 
CAGTCAGAGC CTTXTATATT CTAGAAATCA AAAGAACTAC TTGGCCTGGT TCCAGCAGAA 60 
ACCAGQACAO CC 72 
(2) INFORNATION FOR SBQ ID HO: 78: 



(i) SBQUBNCB CHARACTBRISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDMBSS : single 
20 (D) TOPOLOGY: linear 

(ii) MOLBCDLB TYPB: Other nucleic acid 

(A) DESCRIPTION: /desc » "PRIMBR" 



25 

(xi) SBQOBNCB D8SCRIPTXQN: SBQ ID HO: 78: 
GTCCCTTGTC CGAACGTQAG CGGATAGCTA AAATATTGCT GACAGTCATA AACTGCC 57 
<2) INFORMATION FOR SBQ ID NO: 79: 

(i) SEQUENCE CHARACTBRISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPB: nucleic acid 

(C) STRAHDBDNBSS: single 

(D) TOPOLOGY: linear 

(ii) MOLBCOLB TYPB: Other nucleic acid 

(A) DESCRIPTION: /desc « "PRIMBR" 



(xi) SBQDBNCB DBSCRIPTZCM: SBQ ID NO: 79: 

40 

CCCAAACTCC TCATCTATTG GGCTAGCACT AG6G 

(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQCrKNCB CHARACTERISTICS: 
(A) LENGTH: 34 base pairs 
45 (B) TYPB: nucleic acid 

(C) STRANDBDNBSS: single 

(D) TOPOLOGY: linear 

(ii) MOLBCDLE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc « "PRIMBR" 

50 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 80: 
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CCCTAGTGCT AGCCCAAXA6 ATGAGGAGTT TGGG 
(2) INFORHAnON FOR SBQ ID NO: 81: 

(i) SBQOBNCB C31ARACTBRISTZCS : 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNBSS : single 

(D) TOPOIiOGY: linear 

(ii) NOLBCDLB TYPE: Other nucleic acid 

(A) DESCRIPTION: /deflC » "PRIMER" 



(xi) SBQOBNCB DESCRIPTION: SBQ ID NO: 81: 
TACGCAAACC GCCTCTC 
(2) INFORMATION FOR SBQ ID NO: 82: 

(il SBQaBHCB CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLSCOLB TYPB: other nucleic acid 

(A) DESCRIPTION: /desc - "PRIMER" 



(xi) SBQOBNCB DESCRIPTION: SBQ ID NO: 82: 
GA6TGCACCA TAT6CGGT 
(2) INFORMATION FOR SBQ ID NO: 83: 

(i) SBQOBNCB CHARACTERISTICS: 

(A) LENGTH: IS base pairs 

(B) TYPB: nucleic acid 

(C) STRANDBDNBSS: single 

(D) TOPOLOGY: linear 

(ii) MOI^CDLB TYPE: other nucleic acid 

(A) DESCRIPTION: /desc « "PRIMER" 



(xi) SBQOBNCB DBSCRIPTION: SBQ ID NO: 83: 
AACA8CTAT6 ACCAT6 

(2) INFORMATim FOR SBQ ID NO: 84: 

(i) SBQOKNCB CHARACTERISTICS: 

(A) LBNGTH: 17 base pairs 

(B) TYPB: nucleic acid 

(C) STRANDBDNBSS: single 

(D) TOPOLOGY: linear 

(ii) MDIfCDLB TYPB: other nucleic acid 

(A) DESCRIPTION: /desc = -PRIMER" 
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(xi) SBQUENCB DBSCRIPTION: SBQ ZD NO: 84: 
GTlTrCCCAG TCACGAC 

(2) IMFORMATZON FOR SBQ ID NO: 85: 

(i) SBQDKNCB CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPB: nucleic acid 

(C) STRANDBDNBSS: single 
<D) TOPOUXnr: linear 

(ii) NQXiBOlLB T^B: Other nucleic acid 

(A) DBSCRIPTION: /desc « "PRIHBR" 



(xi) SBQUBNCB DBSCRIPTION: SBQ ID NO: 85: 
GTGTATTCAG T6AA00TQTA TCTACTAOIT TTACAOCTQA CTTTCAC 
(2) INFORMATION FOR SBQ ID NO: 86: 

(i) SBC^TBNCB CHARACTBRISTICS : 

(A) LBN6TH: S3 base pairs 

(B) TYPB: nucleic acid 

(C) STRANDBDNBSS: single 

(D) TOPOLOGY: linear 

<ii) MOLBCOLB TYPB: other nucleic acid 

(A) DESCRIPTION: /desc = "PRIlffiR" 



(xi) SBQOBNCB DBSCRIPTION: SBQ ID NO: 86: 
TAOTAGATAC ACCTTCACT6 AATACACCAT ACACTGCXSTT AGACAGGCCC CTG 
(2) INFORMATION FOR SBQ ID NO: 87: 

(i) SEQaBNCK CHARACTBRISTICS: 

(A) LENGTH: 71 base pairs 

(B) TYPB: nucleic acid 

(C) STRANDBDNBSS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPB: other nucleic acid 

(A) DESCRIPTIM: /desc ■ "PRIMER" 



(xi) SBQDBNCB DBSCRXPTI(»i : SBQ ID NO: 87: 
CCCTTGAACT TCTGOTTGTA OTTAGGAATA CCATT GTr AS GATTAATACC TCCTATCCAC 
TCCAGCCTTT G 

(2) INFORMATION FOR SKQ ID NO: 88: 

(i) SBQUENCB CHARACTBRISTICS: 

(A) LENGTH: 71 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNBSS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPB: other nucleic acid 

(A) DBSCRIPTION: /desc = "PRIMER- 
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(xi) SEQOBNCB DBSCRIPTIW: SBQ ID NO: 88: 
TAACTACAAC CAGAAGTTCA AGGGCCGG6C CACCTTGACC GTAGGCAA6T CTGCCAGCAC 
CGCCTACATQ G 

(2) INFORMATION FOR SBQ ID NO: 89: 

(i) SBQOBNCB CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDSDNBSS : single 

(D) TOPOLOGY: linear 

(ii) MOLBCDI«B TYPE: Other nucleic acid 

(A) DBSCRIPTIGH: /desc - "PRUJBR" 



(xi) SEQOBNCB DESCRIPTION: SBQ ID NO: 89: 
GCATGGCCCT CGTCOTAACC ATAGGCGATT CTTCTTCTGG CGCAOTAGTA GACTGCAGTG 
TCC 

(2) INFORMATION FOR SBQ ID NO: 90: 

(i) SCQUBNCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS: single 

(D) TOPOLOGY: linear 

(ii) HOLBCDUS TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "PRIMER" 



(xi) SBQOENCB DESCRIPTION: SBQ ID NO: 90: 
CTATGGTTAC GACGAGGGCC ATGCTATGGA CTACTGGGGT CAAGGAAC 
(2) INFORMATION FOR SBQ ID NO: 91: 

(i) SEQUBNCB CHARACTERISTICS: 

(A) LENGIH: 71 base pairs 

(B) TYPB: nucleic acid 

(C) STRANDBDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLBCDLB TYPB: other nucleic acid 

(A) DESCRIPTION: /desC = "PRIMER" 



(xi) SEQDENCE DESCRIPTION: SBQ ID NO: 91: 
TAACTACAAC CAGAAGTTCA AQGGCCGGGT CACCATCACC GTAGACACCT CTGCCAGCAC 
CGCCTACAT6 G 

(2) INFORMATION FOR SBQ ID NO: 92: 
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(i) SSQOBNCB CHARACTERISTICS : 

(A) LBNGTH: 27 base pairs 

(B) TYPB: nucleic acid 

(C) STRA2IDBDNBSS : single 

(D) TOPOIOGY: linear 

(ii) MOLBCOLB TYPB: Other nucleic acid 

(A) DBSCRIPTION: /deSC - -PRIMER" 



(xi) SBQOBNCB OBSCRIPTION: SBQ ID NO: 92: 
G6ACACTGCA GTCTACTTCT GCGCCAO 27 
(2) INFORMATION FOR SBQ ID NO: 93: 

(i) SBQOBNCB CHARACTBRISTICS : 

(A) LBNGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNBSS : Single 

(D) TOPOLOGY : linear 

(ii) MOZiBCOLB TYPB: other nucleic acid 

(A) DBSCRIPTION: /deSC » "PRIMBR" 



<xi) SBQDENCS DBSCRIPTION: SBQ ID NO: 93: 

25 

TACGCAAACC OCCTCTC 17 

(2) INFORMATION FOR SBQ ID NO: 94: 

(i) SBQOBNCB CHARACTBRISTICS : 
(A) LBNQTH: 18 base pairs 
30 (B) TYPB: nucleic acid 

(C) STRANDEDNBSS : single 

(D) TOPOIOGY: linear 

(ii) MOIfiCnLB TYPB: Other nucleic acid 

(A) DBSCRIPTION: /desC m "PRIMER* 



(xi) SBQUBNCB DBSCRIPTION: SBQ ID NO: 94: 
GAGTGCACCA TAT6CGGT 18 
(2) INFORMATION FOR SBQ ID NO: 95: 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 base pairs 

(B) TYPB: nucleic acid 

(C) STRANDEDNBSS: single 
45 (D) TOPOLOGY: linear 

(ii) MOLBCOLB TYPB: Other nucleic acid 

(A) DBSCRIPTION: /desc » "PRIMER" 
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(xi) SBQDBNCE DESCRIPTION: SBQ ID NO: 95: 
CCrTTGGCCA GGGGCCT6TC TAACCCAGTG TATGGTGTAT TCAGTGAAGG T6CTATCCAC 60 
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TAsnrccAC tagtit 

(2) INFORMATION FOR SBQ ID NO: 96: 

(i) SBQOBNCB CHARACTSRISTICS : 

(A) LBNOTH: 26 base pairs 

(B) TYPB: nucleic acid 

(C) STRANDBDNBSS : single 

(D) TOPOLOGY: linear 

(ii) MOLBCOLB TYPB: other nucleic acid 

(A) DBSCRIPTION: /desc » *PRIMBR" 



15 (xi) SBQOBNCB DBSCRIPTION: SBQ ID NO: 96: 

GTCACCGTCC TT3ACACGCG TCTCGGOA 28 
(2) INFORMATION FOR SBQ ID NO: 97: 



(i) SBQOBNCB CKARACTBRISTICS : 

(A) LSNGTK: 17 base pairs 

(B) TXPB: nucleic acid 

(C) STRANDBDNBSS: single 
<D) TOPOLOGY: linear 

(ii) MOLBCOLB TYPB: Other nucleic acid 
25 (A) DBSCRIPTION: /desc » "PRIMER" 



(xi) SBQOBNCB DESCRIPTIW: SBQ ID NO: 97: 

TTGGAGGAGG (3T6CCAO 1'^ 

(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUBNCB CHARACTERISTICS: 
(A) liENGTH: 22 base pairs 
35 (B) TYPB: nucleic acid 

(C) STRANDBDNBSS: single 

(D) TOPOXOGY: linear 

(ii) MDLBCDIjB TYPB: Other nucleic acid 

(A) DBSCRIPTION: /desc » "PRIMER" 

40 

(xi) SEQUBNCB DESCRIPTION: SBQ ID NO: 98: 
GAGACATTGT GA C C C AATCT CC 22 
(2) INFORMATION FOR SBQ ID NO: 99: 



(i) SBQUBNCB CHARACTERISTICS: 

(A) LENGTH: 25 base ^airs 

(B) TYPB: nucleic acid 

(C) STRANDBDNBSS: single 

(D) TOPOLOGY: linear 

(ii) MOLBCOLB TYPB: other nucleic acid 

(A) DESCRIPTION: /desc = "PRIMER" 
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(xi) SEQUENCE DBSCRIPTIOH: SEQ ID NO: 99: 
GACAOTCATA AACTGCCACA TCTTC 25 
(2) INFORMATION FOR SBQ ID NO: 100: 

(i) SEQDBNCB CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONBSS: single 

(D) TOPOLOGY: linear 

(ii) MOLBCDIiB TYPB: other nucleic acid 

(A) DESCRIPTION: /desc « "PRIMER" 



(xi) SEQDBNCB DESCRIPTION: SBQ ID NO: 100: 
TTGACACGCG TCTCGGGAAO CTT 23 
(2) INFORMATION FOR SBQ ID NO: 101: 

(i) SEQDBN CB CH ARACTERISTICS : 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRARDEDNBSS : single 

(D) TOPOLOGY: linear 

(ii) MOLBCOLE TYPB: other nucleic acid 

(A) DESCRIPTION: /desc s "PRIMER" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 
GGCGCAGAGG ATCCACTCAC CT 22 



40 Claims 

1. An antibody protein having the complementary determining regions of the monoclonal antibody F19 (ATCC 
Accession No. HB 8269). said antibody protein specifically binding to f broblast activation protein, characterized in 
that it has framework modifications resulting in the improved producibility in host cells as compared to a chimeric 

45 antibody having the variable regions of F19 and foreign constant regions. 

2. An antibody protein characterised in that it has a variable light chain region and a variable heavy chain region 
according to claim 1 , each joined to a human constant region. 

50 3. The antibody protein of claim 2. wherein said human constant region of the light chain is a human kappa con- 
stant region. 

4. The antibody protein of claim 2. wherein said human constant region of the heavy chain is a human gamma-1 
constant region. 

55 

5. An antibody protein according to any one of claims 1 to 4, characterised in that its expression levels In crude 
media samples as determined by ELISA and/or purified antibody yields exceed the expression levels andJor purifi- 
cation yields of the chimeric antibodies without framework modifications by at least a factor of 10. 



93 



EP0953 639A1 



6. An antibody protein acxjording to any one of claims 1 to 4. characterised in that its expression levels in crude 
media samples as determined by ELISA and/or purified antilx)dy yields exceed the expression levels and/or purifi- 
cation yields of the chimeric antibodies without framework modifications by at least a fcictor of 20. 

5 7. An antibody protein according to any one of claims 1 to 4, characterised in that its expression levels In crude 
media samples as determined by ELISA and/or purified antibody yields exceed the expression levels and/or purifi- 
cation yields of the chimeric antibodies without framework nnodifications by at least a factor of 100. 

8. An antibody protein according to any one of claims 1 to 7, characterised in that it displays improved producibility 
10 in eucaryotic cells. 

9. The antibody protein according to claim 8 wherein said eucaryotic cell is a Chinese hamster ovary cell (CHO cell). 

1 0. An antibody protein according to any one of claims 1 to 9. wherein the amino acid in Kabat position 87 of the 
15 light chain region is not asparaglne. 

11. The antibody protein of claim 10. wherein the amino add in Kabat position 87 of the light chain region is 
selected from aromatic or aliphatic amino acids. 

20 12. The antibody protein of claim 1 1 , wherein said aromatic amino acid in Kabat position 87 of the light chain region 
is a tyrosine or phenylalanine. 

1 3. The antibody protein according to any one of claims 1 to 1 2, wherein the amino acid in Kabat position 36 of the 
light chain region is selected from aromatic amino adds. 

25 

1 4. An antibody protein according to any one of claims 1 to 13 that contains the variable region of the light chain as 
set forth in SEQ ID NO: 2. 

1 5. An antibody protein of claim 1 4 characterised In that the variable region of the light chain is encoded by a nude- 
30 otide sequence as set forth in SEQ ID NO: 1 . 

16. An antibody protein according to any one of claims 1 to 13 that contains the variable region of the light chain as 
set forth in SEQ ID NO: 6. 

35 1 7. An antibody protein of claim 1 6 characterised In that the variable regon of the light chain is encoded by a nude- 
otide sequence as set forth in SEQ ID NO: 5. 

18. An antibody protein according to any one of claims 1 to 1 7 containing a variable region of the heavy chain as 
set forth In any one of SEQ ID NOs: 8. 10. 12. 14. 

40 

19. An antibody protein according to daim 18 characterised in that the variable region of the heavy chain is 
encoded by a nucleotide sequence as set forth in SEQ ID NOs: 7. 9. 1 1 , 13. 

20. An antibody protein according to any one of daims 1 to 14 containing the variable region of the light chain as 
45 set forth in SEQ ID NO: 2 and the variable region of the heavy chain as set forth in SEQ ID NOs: 12. 

21 . The antibody protein of claim 20 characterised in that the variable region of the the light chain is encoded by a 
nucleotide sequence as set forth in SEQ ID NO: 1 and the variable region of the heavy chain is encoded by a nude- 
otide sequence as set forth in SEQ ID NO: 1 1 . 

50 

23. An antibody protein according to any one of daims 1 to 13 containing the variable region of the light chain as 
set forth in SEQ ID NO: 2 and the variable region of the heavy chain as set forth in SEQ ID NOs: 8. 

24. TTie antibody protein of claim 23 characterised in that the variable region of the the light chain is encoded by a 
55 nucleotide sequence as set forth in SEQ ID NO: 1 and the variable region of the heavy chain is encoded by a nude- 

otide sequence as set forth in SEQ ID NO: 7. 

25. A nucleotide sequence encoding an antibody protein according to any one of daims 1 to 24. 
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26. A recombinant DNA vector that contains a nucleotide sequence of claim 25. 

27. The recombinant DNA vector of claim 26. said vector being an expression vector. 
5 28. A host ceil carrying a vector according to daims 26 or 27. 

29. The host cell of claim 28. wherein said host cell is a eucaryotic cell. 

30. The host cell of claim 29. wherein said eucaryotic host cell is a mammalian cell. 

10 

31. The host cell of claim 30, wherein said host cell is a CHO or a COS cell. 

32. A method of producing antibody proteins according to any one of claims 1 to 24, said method comprising the 
steps of: 

IS 

(a) cultivating a host cell according to any one of daims 23 to 26 under conditions where said antibody protein 
is expressed by said host ceil, and 

(b) isolating said antibody protein. 

20 33. The method of claim 32. wherein said host cell is a mammalian cell, preferably a CHO or COS cell. 

34. The method of daim 32 or 33. wherein said host cell is cotransfected with two plasmids canrying the expression 
units for light and heavy chains respectively. 

25 35. An antibody protein according to any one of claims 1 to 24, wherein said antibody protein is conjugated to a 
therapeutic agent. 

36. The antibody protein of daim 35. wherein said therapeutic agent is a therapeutic agent selected from the group 
consisting of radioisotopes, toxins, toxoids, Inflammatory agents and chemotherapeutic agents. 

30 

37. The antibody protein of daim 36, wherein said radioisotopes are p-emitting radioisotopes. 

38. The anti>ody protein of daim 37, wherein said radioisotopes are selected from the group consisting of 
^^Rhenium. ^^reienium, iodine and Yttrium. 

35 

39. An antOaody protein according to any one of claims 1 to 24, characterised in that it is labeled. 

40. The antibody protein of daim 39, wherein said label is a detectable marker. 

40 41 . The antibody protein of claim 40. wherein the detectable marker Is a detectable marker selected from the group 
consisting of enzymes, dyes, radioisotopes, and biotin. 

42. An antibody protein according to any one of claims 1 to 24 conjugated to an imageable agent. 
45 43. The arrtbody protein of daim 42, wherein the imageable agent is a radioisotope. 

44. The antibody protein of daim 43, wherein said radioisotopes are gamma-emitting radioisotopes??. 
45b The antibody protein of daim 44, wherein said radioisotopes is ^^^1. 

so 

46. A pharmaceutical composition containing an antibody protein according to any one of daims 1 to 24 and a 
pharmaceutically acceptable earner useful for treating tumors, wherein said tumors are associated with activated 
stromal fibroblasts. 

55 47. A pharmaceutical composition containing an antibody protein according to any one of claims 35 to 38 and a 
pharmaceutically acceptable carrier useful fbr treating tumors, wherein said tumors are associated with activated 
stromal fibroblasts. 
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48. A pharmaceutical composition corrtaining an antibody protein according to any one of daims 42 to 45 and a 
pharmaceuticaily acceptable carrier useful for imaging the presence of activated stromal fibroblasts in a healing 
wound, inflamed skin or a tumor, In a hunnan patient. 

49. The pharmaceutical composition of clainrts 46 to 48. wherein said tunnors are tumors selected from the cancer 
group consisting of colorectal cancers, non-small cell lung cancers, breast cancers, head and neck cancer, ovarian 
cancers, lung cancers, bladder cancers, pancreatic cancers and metastatic cancers of the brain. 

50. Use of an antibody protein according to anyone of claims 1 to 24 for the treatment of cancer. 

51 . Use of an antibody protein according to anyone of claims 35 to 38 for the treatment of cancer. 

52. Use of an antibody protein according to anyone of claims 42 to 45 for imaging activated activated stromal fibrob- 
lasts. 

53. Use of an antibody protein according to anyone of claims 39 to 41 for detecting the presence of activated stro- 
mal fibroblasts in a sample. 

54. A method of treating tumors, wherein the tumor is associated with activated stromal fibroblasts capable of spe- 
cifically forming a complex with antibody proteins according to any one of claims 1 to 24 or 35 to 38. which com- 
prises contacting the tumor with an anuxjnt of said antibody proteins effective to treat the tumor. 

55. The method of claim 54. wherein the tumor is a tumor having cancer cells selected from the cancer group con- 
sisting of colorectal cancers, non-small cell lung cancers, breast cancers, head and neck cancer, ovarian cancers, 
lung cancers, bladder cancers, pancreatic cancers and metastatic cancers of the brain. 

56. The method of claim 54. wherein the contacting is effected in vitro. 

57. The method of claim 54. wherein the contacting is effected in vivo. 

58. A method of detecting the presence of activated stromal fibroblasts in wound healing, inflammation or a tumor, 
characterised in that 

(a) a sample, possibly containing activated stromal fibroblasts, is contacted with an antibody protein according 
to any one of claims 1 to 24 or 39 to 41 under conditions suitable for tiie formation of a complex between said 
antibody and antigen. 

(b) detecting the presence of said complex, tiiereby detecting the presence of activated stromal fibroblasts in 
wound healing, inflammation or a tumor. 

59. The method of daim 58, wherein the tumor is a tumor having cancer cells selected from the cancer group con- 
sisting of cotorectal cancers, non-small cell lung cancers, breast cancers, head and neck cancer, ovarian cancers, 
lung cancers, bladder cancers, pancreatic cancers and metastatic cancers of the brain. 

60. The method of daim 58 or 59. wherein the antibody protein is a protein according to any one of daims 39 to 41 . 

61 . A method of imaging the presence of activated stromal fibroblasts in a healing wound, inflamed skin or a tumor, 
in a human patient, characterised in tiiat 

(a) an antibody protein according to any one of daims 1 to 24 conjugated to an imageable agent is adminis- 
tered to a human patient under conditions suitable for the formation of an antibody-antigen complex, 

(b) imaging any complex formed in this manner, 

(c) thereby imaging the presence of activated stromal fibroblasts in a human patient. 

62. The method of daim 61 , wherein the tumor is a tumor having cancer cells selected from the cancer group con- 
sisting of cotorectal cancers, non-small cell lung cancers, breast cancers, head and neck cancer, ovarian cancers, 
lung cancers, bladder cancers, pancreatic cancers and m^astatic cancers of the brain. 

63. A method of detecting tumor-stroma, characterised in that 
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(a) a suitable sample is contacted with an antibody protein according to any one of claims 1 to 24. under con- 
ditions suitable for the formation of an antibody-antigen complex. 

(b) detecting the presence of any complex so formed. 

(c) relating the presence of said complex to the presence of tumor-stroma. 

64. The method of claim 62. wherein said antibody is labelled with a detectable marker. 

65. A method of imaging tumor-stroma in a human patient, which comprises 

(a) adminstering to the patient an antibody protein according to any one of claims 42 to 45, under conditions 
suitable for the formation of an antibody-antigen complex, 

(b) imaging any complex so formed, and thereby imaging the presence of tumor-stroma in a human patient. 
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Fig.1 
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Fig. 2 

1 11 21 31 41 

DIVMTQSPDS LAVSLGERAT INCKSSQSLL YSRNQKNYLA WYQQKPGQPP 
51 61 71 81 91 

KLLIFWASTR ESGVPDRFSG SGFGTDFTLT ISSLQAEDVA VYYCQQYFSY 
101 111 
PLTFGQGTKV EIK 



Fig. 3 

1 11 21 31 41 

GACATTGTGA TGACCCAATC TCCAGACTCT TTGGCTGTGT CTCTAGGGGA 
51 61 71 81 91 

GAGGGCCACC ATCAACTGCA AGTCCAGTCA GAGCCTTTTA TATTCTAGAA 
101 111 121 131 141 

ATCAAAAGAA CTACTTGGCC TGGTTCCAGC AGAAACCAGG ACAGCCACCC 
151 161 171 181 191 

AAACTCCTCA TCTTTTGGGC TAGCACTAGG GAATCTGGGG TACCTGATAG 
201 211 221 231 241 

GTTCAGTGGC AGTGGGTTTG GGACAGACTT CACCCTCACC ATTAGCAGCC 
251 261 271 281 291 

TGCAGGCTGA AGATGTGGCA GTTTATGACT GTCAACAATA TTTTAGCTAT 
301 311 321 ~ 331 339 

CCGCTCACGT TCGGACAAGG GACCAAGGTG GAAATAAAA 
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Fig. 4 
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81 



91 



51 61 71 

KLLIFWASTR ESGVPDRFSG SGFGTDFTLT ISSLQAEDVA VYDCQQYFSY 
101 111 
PLTFGQGTKV EIK 



Fig. 5 

1 11 21 31 41 

GACATTGTGA TGACCCAATC TCCAGACTCT TTGGCTGTGT CTCTAGGGGA 
51 61 71 81 91 

GAGGGCCACC ATCAACTGCA AGTCCAGTCA GAGCCTTTTA TATTCTAGAA 
101 111 121 131 141 

ATCAAAAGAA CTACTTGGCC TGGTATCAGC AGAAACCAGG ACAGCCACCC 
151 161 171 181 191 

AAACTCCTCA TCTATTGGGC TAGCACTAGG GAATCTGGGG TACCTGATAG 
201 211 221 231 241 

GTTCAGTGGC AGTGGGTTTG GGACAGACTT CACCCTCACC ATTAGCAGCC 
251 261 271 281 291 

TGCAGGCTGA AGATGTGGCA GTTTATTACT GTCAGCAATA TTTTAGCTAT 
301 311 321 331 339 

CCGCTCACGT TCGGACAAGG GACCAAGGTG GAAATAAAA 



Rg.6 

1 11 21 31 41 

DIVMTQSPDS LAVSLGERAT INCKSSQSLL YSRNQKNYLA WYQQKPGQPP 
51 61 71 81 91 

KLLIYWASTR ESGVPDRFSG SGFGTDFTLT ISSLQAEDVA VYYCQQYFSY 
101 ~ 111 
PLTFGQGTKV EIK 
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Fig. 7 



1 

CAGGTGCAAC TAGTGCAGTC CGGCGCCGAA GTGAAGAAAC CCGGTGCTTC 
51 

CGTGAAAGTC AGCTGTAAAA CTAGTAGATA CACCTTCACT GAATACACCA 
101 

TACACTGGGT TAGACAGGCC CCTGGCCAAA GGCTGGAGTG GATAGGAGGT 

151 

ATTAATCCTA ACAATGGTAT TCCTAACTAC AACCAGAAGT TCAAGGGCCG 
201 

GGCCACCTTG ACCGTAGGCA AGTCTGCCAG CACCGCCTAC ATGGAACTGT 
251 

CCAGCCTGCG CTCCGAGGAC ACTGCAGTCT ACTACTGCGC CAGAAG7AGA 
301 

ATCGCCTATG GTTACGACGA GGGCCATGCT ATGGACTACT GGGGTCAAGG 
351 372 
AACCCTTGTC ACCGTCTCCT CA 



Rg.8 



1 11 21 31 41 

QVQLVQSGAE VKKPGASVKV SCKTSRYTFT EYTIHWVRQA PGQRLEWIGG 
51 61 71 81 91 

INPNNGIPNY NQKFKGRATL TVGKSASTAY MELSSLRSED TAVYYCARRR 
101 111 121-124 

lAYGYDEGHA MDYWGQGTLV TVSS 



Fig. 9 



1 

CAGGTGCAAC TAGTGCAGTC CGGCGCCGAA GTGAAGAAAC CCGGTGCTTC 
51 

CGTGA7VAGTC AGCTGTAAAA CTAGTAGATA CACCTTCACT GAATACACCA 
101 

TACACTGGGT TAGACAGGCC CCTGGCCAAA GGCTGGAGTG GATAGGAGGT 
151 

ATTAATCCTA ACAATGGTAT TCCTAACTAC AACCAGAAGT TCAAGGGCCG 
201 

GGCCACCTTG ACCGTAGGCA AGTCTGCCAG CACCGCCTAC ATGGAACTGT 
251 

CCAGCCTGCG CTCCGAGGAC ACTGCAGTCT ACTTCTGCGC CAGAAGAAGA 
301 

ATCGCCTATG GTTACGACGA GGGCCATGCT ATGGACTACT GGGGTCAAGG 
351 372 
AACCCTTGTC ACCGTCTCCT CA 
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Fig. 10 



1 11 21 31 41 

QVQLVQSGAE VKKPGASVKV SCKTSRYTFT EYTIHWVRQA PGQRLEWIGG 
51 61 71 81 91 

INPNNGIPNY NQKFKGRATL TVGKSASTAY MELSSLRSED TAVYFCARRR 
101 111 121-124 

lAYGYDEGHA MDYWGQGTLV TVSS 



Fig. 11 



1 

CAGGTGCAAC TAGTGCAGTC CGGCGCCGAA GTGAAGAAAC CCGGTGCTTC 

51 

CGTGAAAGTC AGCTGTAAAA CTAGTAGATA CACCTTCACT GAATACACCA 
101 

TACACTGGGT TAGACAGGCC CCTGGCCAAA GGCTGGAGTG GATAGGAGGT 
151 

ATTAATCCTA ACAATGGTAT TCCTAACTAC AACCAGAAGT TCAAGGGCCG 
201 

GGTCACCATC ACCGTAGACA CCTCTGCCAG CACCGCCTAC ATGGAACTGT 
251 

CCAGCCTGCG CTCCGAGGAC ACTGCAGTCT ACTACTGCGC CAGAAGAAGA 
301 

ATCGCCTATG GTTACGACGA GGGCCATGCT ATGGACTACT GGGGTCAAGG 
351 372 
AACCCTTGTC ACCGTCTCCT CA 



Fig. 12 



1 11 21 31 41 

QVQLVQSGAE VKKPGASVKV SCKTSRYTFT EYTIHWVRQA PGQRLEWIGG 
51 61 71 81 91 

INPNNGIPNY NQKFKGRVTI TVDTSASTAY MELSSLRSED TAVYYCARRR 
101 111 121-124 

lAYGYDEGHA MDYWGQGTLV TVSS 
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Fig. 13 

1 

CAGGTGCAAC TAGTGCAGTC CGGCGCCGAA GTGAAGAAAC CCGGTGCTTC 

51 

CGTGAAAGTC AGCTGTAAAA CTAGTAGATA CACCTTCACT GAATACACCA 
101 

TACACTGGGT TAGACAGGCC CCTGGCCAAA GGCTGGAGTG GATAGGAGGT 
151 

ATTAATCCTA ACAATGGTAT TCCTAACTAC AACCAGAAGT TCAAGGGCCG 
201 

GGTCACCATC ACCGTAGACA CCTCTGCCAG CACCGCCTAC ATGGAACTGT 
251 

CCAGCCTGCG CTCCGAGGAC ACTGCAGTCT ACTTCTGCGC CAGAAGAAGA 
301 

ATCGCCTATG GTTACGACGA GGGCCATGCT ATGGACTACT GGGGTCAAGG 
351 372 
AACCCTTGTC ACCGTCTCCT CA 



Rg. 14 

1 11 21 31 41 

QVQLVQSGAE VKKPGASVKV SCKTSRYTFT EYTIHWVRQA PGQRLEWIGG 
51 61 71 81 91 

INPNNGIPNY NQKFKGRVTI TVDTSASTAY MELSSLRSED TAVYFCARRR 
101 111 121-124 

lAYGYDEGHA MDYWGQGTLV TVSS 



Fig. 15 

1 

CAGGTGCAAC TAGTGCAGTC CGGCGCCGAA GTGAAGAAAC CCGGTGCTTC 
51 

CGTGAAAGTC AGCTGTAAAA CTAGTGGATA CACCTTCACT GAATACACCA 
101 

TACACTGGGT TAGACAGGCC CCTGGCCAAA GGCTGGAGTG GATAGGAGGT 
151 

ATTAATCCTA ACAATGGTAT TCCTAACTAC AACCAGAAGT TCAAGGGCCG 
201 

GGTCACCATC ACCGTAGACA CCTCTGCCAG CACCGCCTAC ATGGAACTGT 
251 

CCAGCCTGCG CTCCGAGGAC ACTGCAGTCT ACTACTGCGC CAGAAGAAGA 
301 

ATCGCCTATG GTTACGACGA GGGCCATGCT ATGGACTACT GGGGTCAAGG 
351 372 
AACCCTTGTC ACCGTCTCCT CA 
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Fig. 16 



1 11 21 31 41 

QVQLVQSGAE VKKPGASVKV SCKTSGYTFT EYTIHWVRQA PGQRLEWIGG 
51 61 71 ~ 81 91 

INPNNGIPNY NQKFKGRVTI TVDTSASTAY MELSSLRSED TAVYYCARRR 
101 111 " 121-124 

lAYGYDEGHA MDYWGQGTLV TVSS 



Fig. 17 



1 

DIVMSQSPSS LAVSVGEKVT MSCKSSQSLL YSRNQKNYLA WFQQKPGQSP 

51 

KLLIFWASTR ESGVPDRFTG SGFGTDFNLT ISSVQAEDLA VYDCQQYFSY 
101 

PLTFGAGTKL ELKRTVAAPS VFIFPPSDEQ LKSGTASWC LLNNFYPREA 
151 

KVQWKVDNAL QSGNSQESVT EQDSKDSTYS LSSTLTLSKA DYEKHKVYAC 
201 

EVTHQGLSSP VTKSFNRGEC 



Rg. 18 



1 

VQLQQSGPEL VKPGASVKMS CKTSRYTFTE YTIHWVRQSH GKSLEWIGGI 
51 

NPNNGIPNYN QKFKGRATLT VGKSSSTAYM ELRSLTSEDS AVYFCARRRI 
101 

AYGYDEGHAM DYWGQGTSVT VSSASTKGPS VFPLAPSSKS TSGGTAALGC 
151 

LVKDYFPEPV TVSWNSGALT SGVHTFPAVL QSSGLYSLSS WTVPSSSLG 
201 

TQTYICNVNH KPSNTKVDKK VEPKSCDKTH TCPPCPAPEL LGGPSVFLFP 
251 

PKPKDTLMIS RTPEVTCVW DVSHEDPEVK FNWYVDGVEV HNAKTKPREE 

301 

QYNSTYRWS VLTVLHQDWL NGKEYKCKVS NF^ALPAPIEK TISKAKGQPR 
351 

EPQVYTLPPS REEMTKNQVS LTCLVKGFYP SDIAVEWESN GQPENNYKTT 
401 

PPVLDSDGSF FLYSKLTVDK SRWQQGNVFS CSVMHEALHN HYTQKSLSLS 

451 
PGK 
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Fig. 19 
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Fig. 20 



114 124 134 144 154 

RTVAAPSVFI FPPSDEQLKS GTASWCLLN NFYPREAKVQ WKVDNALQSG 

164 174 184 194 204 

NSQESVTEQD SKDSTYSLSS TLTLSKADYE KHKVYACEVT HQGLSSPVTK 
214-220 
SFNRGEC 
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Fig. 21 

373 

GCCTCCACCA AGGGCCCATC GGTCTTCCCC CTGGCACCCT CCTCCAAGAG 
423 

CACCTCTGGG GGCACAGCGG CCCTGGGCTG CCTGGTCAAG GACTACTTCC 
473 

CCGAACCGGT GACGGTGTCG TGGAACTCAG GCGCCCTGAC CAGCGGCGTG 
523 

CACACCTTCC CGGCTGTCCT ACAGTCCTCA GGACTCTACT CCCTCAGCAG 
573 

CGTGGTGACC GTGCCCTCCA GCAGCTTGGG CACCCAGACC TACATCTGCA 
623 

ACGTGAATCA CAAGCCCAGC AACACCAAGG TGGACAAGAA AGTTGAGCCC 
673 

AAATCTTGTG ACAAT^CTCA CACATGCCCA CCGTGCCCAG CACCTGAACT 
723 

CCTGGGGGGA CCGTCAGTCT TCCTCTTCCC CCCAAAACCC AAGGACACCC 
773 

TCATGATCTC CCGGACCCCT GAGGTCACAT GCGTGGTGGT GGACGTGAGC 
823 

CACGAAGACC CTGAGGTCAA GTTCAACTGG TACGTGGACG GCGTGGAGGT 

873 

GCATAATGCC 7WVGACAAAGC CGCGGGAGGA GCAGTACAAC AGCACGTACC 
923 

GGGTGGTCAG CGTCCTCACC GTCCTGCACC AGGACTGGCT GAATGGCAAG 
973 

GAGTACAAGT GCAAGGTCTC CAACAAAGCC CTCCCAGCCC CCATCGAGAA 

1023 

AACCATCTCC AAAGCCAAAG GGCAGCCCCG AGAACCACAG GTGTACACCC 
1073 

TGCCCCCATC CCGGGAGGAG ATGACCAAGA ACCAGGTCAG CCTGACCTGC 
1123 

CTGGTCAAAG GCTTCTATCC CAGCGACATC GCCGTGGAGT GGGAGAGCAA 
1173 

TGGGCAGCCG GAGAACAACT ACAAGACCAC GCCTCCCGTG CTGGACTCCG 
1223 

ACGGCTCCTT CTTCCTCTAC AGCAAGCTCA CCGTGGACAA GAGCAGGTGG 

1273 

CAGCAGGGGA ACGTCTTCTC ATGCTCCGTG ATGCATGAGG CTCTGCACAA 
1323 1362 
CCACTACACG CAGAAGAGCC TCTCCCTGTC TCCGGGTAAA 
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Fig. 22 

125 

ASTKGPSVFP LAPSSKSTSG GTAALGCLVK DYFPEPVTVS WNSGALTSGV 

175 

HTFPAVLQSS GLYSLSSWT VPSSSLGTQT YICNVNHKPS NTKVDKKVEP 

225 

KSCDKTHTCP PCPAPELLGG PSVFLFPPKP KDTLMISRTP EVTCVWDVS 
275 

HEDPEVKFNW YVDGVEVHNA KTKPREEQYN STYRWSVLT VLHQDWLNGK 
325 

EYKCKVSNKA LPAPIEKTIS KAKGQPREPQ VYTLPPSREE MTKNQVSLTC 

375 

LVKGFYPSDI AVEWESNGQP ENNYKTTPPV LDSDGSFFLY SKLTVDKSRW 
425 454 
\ QQGNVFSCSV MHEALHNHYT QKSLSLSPGK 
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Fig. 23A 
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Fig. 23B 
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Fig. 24 



Hindi 1 1 

aagcttGCCGCCACCatggattcacaggcccaggttcttatgttactgccgctatgggta 

1 + + + + + + 

ttcgaaCGGCGGTGGtacctaagtgtccgggtccaagaatacaatgacggcgatacccat 

Kozak sequence 

MDSQAQVLMLLPLWV 

tctggtacctgtggggacattgtgatgtcacagtctccatcctccctagctgtgtcagtt 

61 + + + + + + 

agaccatggacacccctgtaacactacagtgtcagaggtaggagggatcgacacagtcaa 

SGTCGDIVMSQS PSSLAVSV 

ggagagaaggttactatgagctgcaagtccagtcagagccttttatatagtcgtaatcaa 

121 + + + + + 

cctctcttccaatgatactcgacgttcaggtcagtctcggaaaatatatcagcattagtt 

GEKVTMSC K S S Q S L L Y S R N Q 

CDR 1 

aagaactacttggcctggttccagcagaagccagggcagtctcctaaactgctgattttc 

181 + + + + + + 

ttcttgatgaaccggaccaaggtcgtcttcggtcccgtcagaggatttgacgactaaaag 

K N Y L A WFQQKPGQSPKLLI F 

tgggcatccactagggaatctggggtccctgatcgcttcacaggcagtggatttgggacg 

241 + + + + + + 

acccgtaggtgatcccttagaccccagggactagcgaagtgtccgtcacctaaaccctgc 

W A S T R E $ GVPDRFTGSGFGT 
CDR 2 

gatttcaatctcaccatcagcagtgtgcaggctgaggacctggcagtttatgactgtcag 

301 + + + + + + 

ctaaagttagagtggtagtcgtcacacgtccgactcctggaccgtcaaatactgacagtc 

DFNLTI SSVQAEDLAVYDCO 

caatattttagctatccgctcacgttcggtgctgggaccaagctggagctgaAACGTGAG 

361 + + + + + + 

gttataaaatcgataggcgagtgcaagccacgaccctggttcgacctcgactTTGCACTG 

splice donor site 

QYFSYPLT FGAGTKLELK 
CDR 3 

BamHI 
Tggatcc 

421 427 

Acctagg 
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Fig. 25 



Hindi I I 

AAOCTTGCCGCCACCATGGGATGGAGCTGGGTCTTTCTCTTTCTCCTGTCAGGAACTGCA 

1 + + + + + + 

TTCGAA CGGCGGTGG TACCCTACCTCGACCCAGAAAGAGAAAGAGGACAGTCCTTGACGT 
Kozak sequence 

MGWSWVFLFLLSGTA 

GGTGTCCTCTCTGAGGTCCAGCTGCAACAGTCTGGACCTGAGCTGGTGAAGCCTGGGGCT 

61 + + + + + 

CCACAGGAGAGACTCCAGGTCGACGTTGTCAGACCTGGACTCGACCACTTCGGACCCCGA 

GVLSEVQLQQSGPELVKPGA 

TCAGTAAAGATGTCCTGCAAGACTTCTAGATACACATTCACTGAATACACCATACACTGG 

121 + + + + + + 

AGTCATTTCTACAGGACGTTCTGAAGATCTATGTGTAAGTGACTTATGTGGTATGTGACC 

SVKMSCKTSRYTFT E Y T I H W 

CDR 1 

GTGAGACAGAGCCATGGAAAGAGCCTTGAGTGGATTGGAGGTATTAATCCTAACAATGGT 

181 + + + + + + 

CACTCTGTCTCGGTACCTTTCTCGGAACTCACCTAACCTCCATAATTAGGATTGTTACCA 

VRQSHGKSLEWIG G I N P N N G 

CDR 2 

ATTCCTAACTACAACCAGAAGTTCAAGGGCAGGGCCACATTGACTGTAGGCAAGTCCTCC 

241 + + + + + + 

TAAGGATTGATGTTAGTCTTCAAGTTCCCGTCCCGGTGTAACTGACATCCGTTCAGGAGG 

IPNYNQKFKG RATLTVGKSS 



AGCACCGCCTACATGGAGCTCCGCAGCCTGACATCTGAGGATTCTGCGGTCTATTTCTGT 

301 + + + + + + 

TCGTGGCGGATGTACCTCGAGGCGTCGGACTGTAGACTCCTAAGACGCCAGATAAAGACA 

STAYMELRSLTSEDSAVYFC 

GCAAGAAGAAGAATCGCCTATGGTTACGACGAGGGCCATGCTATGGACTACTGGGGTCAA 
361 + + + + + + 

CGTTCTTCTTCTTAGCGGATACCAATGCTGCTCCCGGTACGATACCTGATGACCCCAGTT 

A R RRIAYGYDEGHAMDY W G Q 

CDR 3 

BamHI 

GGAACCTCAGTCACCGTCTCCTCAGGTGAGTGGATCC 

421 + + + 468 

CCTTGGAGTCAGTGGCAGAGGAGTCCACTCACCTAGG 

splice donor site 
GTSVTVSS 
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Fig. 26 /I 

Spe I 

1 gaattccagc acactggcgg ccqtt ACTAG T TATTAATAG TAATCAATTA 

51 CGGGGTCATT AGTTCATAGC CCATATATGG AGTTCCGCGT TACATAACTT 

101 ACGGTAAATG GCCCGCCTGG CTGACCGCCC AACGACCCCC GCCCATTGAC 

151 GTCAATAATG ACGTATGTTC CCATAGTAAC GCCAATAGGG ACTTTCCATT 

201 GACGTCAATG GGTGGAGTAT TTACGGTAAA CTGCCCACTT GGCAGTACAT 

251 CAAGTGTATC ATATGCCAAG TACGCCCCCT ATTGACGTCA ATGACGGTAA 

301 ATGGCCCGCC TGGCATTATG CCCAGTACAT GACCTTATGG GACTTTCCTA 

SnaB I 

351 CTTGGCAGTA CAT CTACGTA TTAGTCATCG CTATTACCAT GGTGATGCGG 

401 TTTTGGCAGT ACATCAATGG GCGTGGATAG CGGTTTGACT CACGGGGATT 

451 TCCAAGTCTC CACCCCATTG ACGTCAATGG GAGTTTGTTT TGGCACCAAA 

501 ATCAACGGGA CTTTCCAAAA TGTCGTAACA ACTCCGCCCC ATTGACGCAA 

551 ATGGGCGGTA GGCGTGTACG GTGGGAGGTC TATATAAGCA GAGCTCGTTT 

601 AGTGAACCGT CAGATCGCCT GGAGACGCCA TCCACGCTGT TTTGACCTCC 

Sac II 

651 ATAGAAGACA CCGGGACCGA TCCAGCCT CC GCGGC CGGGA ACGGTGCATT 

701 GGAACGCGGA TTCCCCGTGC CAAGAGTGAC GTAAGTACCG CCTATAGAGT 

751 CTATAGGCCC ACCCCCTTGG CTTCTTATGC ATGCTATACT GTTTTTGGCT 

801 TGGGGTCTAT ACACCCCCGC TTCCTCATGT TATAGGTGAT GGTATAGCTT 

851 AGCCTATAGG TGTGGGTTAT TGACCATTAT TGACCACTCC CCTATTGGTG 

901 ACGATACTTT CCATTACTAA TCCATAACAT GGCTCTTTGC CACAACTCTC 

951 TTTATTGGCT ATATGCCAAT ACACTGTCCT TCAGAGACTG ACACGGACTC 

1001 TGTATTTTTA CAGGATGGGG TCTCATTTAT TATTTACAAA TTCACATATA 

1051 CAACACCACC GTCCCCAGTG CCCGCAGTTT TTATTAAACA TAACGTGGGA 

BspE I 

1101 TCTCCACGCG AATCTCGGGT ACGTGT TCCG GA CATGGGCT CTTCTCCGGT 
1151 AGCGGCGGAG CTTCTACATC CGAGCCCTGC TCCCATGCCT CCAGCGACTC 
1201 ATGGTCGCTC GGCAGCTCCT TGCTCCTAAC AGTGGAGGCC AGACTTAGGC 
1251 ACAGCACGAT GCCCACCACC ACCAGTGTGC CGCACAAGGC CGTGGCGGTA 
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Fig. 26/3 

2351 TGAGCAAAGC AGACTACGAG AAACACAAAG TCTACGCCTG CGAAGTCACC 

LSKA DYE KHK VYAC EVT 

2401 CATCAGGGCC TGAGCTCGCC CGTCACAAAG AGCTTCAACA GGGGAGAGTG 
HQG LSSP VTK SFN RGEC 

2451 TTAGAGGGAG AAGTGCCCCC ACCTGCTCCT CAGTTCCAGC CTGACCCCCT 

2501 CCCATCCTTT GGCCTCTGAC CCTTTTTCCA CAGGGGACCT ACCCCTATTG 

2551 CGGTCCTCCA GCTCATCTTT CACCTCACCC CCCTCCTCCT CCTTGGCTTT 

2601 AATTATGCTA ATGTTGGAGG AGAATGAATA AATAAAGTGA ATCTTTGCAC 

2651 CTGTGGTGGA TCTAATAAAA GATATTTATT TTCATTAGAT ATGTGTGTTG 

2701 GTTTTTTGTG TGCAGTGCCT CTATCTGGAG GCCAGGTAGG GCTGGCCTTG 

2751 GGGGAGGGGG AGGCCAGAAT GACTCCAAGA GCTACAGGAA GGCAGGTCAG 

2801 AGACCCCACT GGACAAACAG TGGCTGGACT CTGCACCATA ACACACAATC 

2851 AACAGGGGAG TGAGCTGGAA ATTTGCTAGC GAATTCTTGA AGACGAAAGG 

2901 GCCTCGTGAT ACGCCTATTT TTATAGGTTA ATGTCATGAT AATAATGGTT 

2951 TCTTAGACGT CAGGTGGCAC TTTTCGGGGA AATGTGCGCG GAACCCCTAT 

3001 TTGTTTATTT TTCTAAATAC ATTCAAATAT GTATCCGCTC ATGAGACAAT 

3051 AACCCTGATA AATGCTTCAA TAATATTGAA AAAGGAAGAG TATGAGTATT 

3101 CAACATtTCC GTGTCGCCCT TATTCCCTTT TTTGCGGCAT TTTGCCTTCC 

3151 TGTTTTTGCT CACCCAGAAA CGCTGGTGAA AGTAA7UVGAT GCTGAAGATC 

3201 AGTTGGGTGC ACGAGTGGGT TACATCGAAC TGGATCTCAA CAGCGGTAAG 

3251 ATCCTTGAGA GTTTTCGCCC CGAAGAACGT TTTCCAATGA TGAGCACTTT 

3301 TAAAGTTCTG CTATGTGGCG CGGTATTATC CCGTGTTGAC GCCGGGCAAG 

3351 AGCAACTCGG TCGCCGCATA CACTATTCTC AGAATGACTT GGTTGAGTAC 

3401 TCACCAGTCA CAGAAAAGCA TCTTACGGAT GGCATGACAG TAAGAGAATT 

3451 ATGCAGTGCT GCCATAACCA TGAGTGATAA CACTGCGGCC AACTTACTTC 
Pvu I 

3501 TGACA ACGAT CGG AGGACCG AAGGAGCTAA CCGCTTTTTT GCACAACATG 

3551 GGGGATCATG TAACTCGCCT TGATCGTTGG GAACCGGAGC TGAATGAAGC 

3601 CATACCAAAC GACGAGCGTG ACACCACGAT GCCTGCAGCA ATGGCAACAA 
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Fig. 26/4 
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CATTTTTAAT 
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CTAGGTGAAG 


ATCCTTTTTG 


ATAATCTCAT 


4051 


GACCAAAATC 


CCTTAACGTG 


AGTTTTCGTT 


CCACTGAGCG 


TCAGACCCCG 


4101 


TAGAAAAGAT 


CAAAGGATCT 


TCTTGAGATC 


CTTTTTTTCT 


GCGCGTAATC 


4151 


TGCTGCTTGC 


AAACAAAAAA 


ACCACCGCTA 


CCAGCGGTGG 


TTTGTTTGCC 


4201 


GGATCAAGAG 


CTACCAACTC 


TTTTTCCGAA 


GGTAACTGGC 


TTCAGCAGAG 


4251 


CGCAGATACC 


AAATACTGTC 


CTTCTAGTGT 


AGCCGTAGTT 


AGGCCACCAC 


4301 


TTCAAGAACT 


CTGTAGCACC 


GCCTACATAC 


CTCGCTCTGC 


TAATCCTGTT 


4351 


ACCAGTGGCT 


GCTGCCAGTG 


GCGATAAGTC 


GTGTCTTACC 


GGGTTGGACT 


4401 


CAAGACGATA 


GTTACCGGAT 


AAGGCGCAGC 


GGTCGGGCTG 


AACGGGGGGT 


4451 


TCGTGCACAC 


AGCCCAGCTT 


GGAGCGAACG 


ACCTACACCG 


AACTGAGATA 


4501 


CCTACAGCGT 


GAGCTATGAG 


AAAGCGCCAC 


GCTTCCCGAA 


GGGAGAAAGG 


4551 


CGGACAGGTA 


TCCGGTAAGC 


GGCAGGGTCG 


GAACAGGAGA 


GCGCACGAGG 


4601 


GAGCTTCCAG 


GGGGAAACGC 


CTGGTATCTT 


TATAGTCCTG 


TCGGGTTTCG 


4651 


CCACCTCTGA 


CTTGAGCGTC 


GATTTTTGTG 


ATGCTCGTCA 


GGGGGGCGGA 


4701 


GCCTATGGAA 


AAACGCCAGC 


AACGCGGCCT 


TTTTACGGTT 


CCTGGCCTTT 



BspLUllI 

4751 TGCTGGCCTT TTGCTCACAT GTTCTTTCCT GCGTTATCCC CTGATTCTGT 



4801 GGATAACCGT ATTACCGCCT TTGAGTGAGC TGATACCGCT CGCCGCAGCC 
4851 GAACGACCGA GCGCAGCGAG TCAGTGAGCG AGGAAGCGGA AGAGCGCCTG 
4901 ATGCGGTATT TTCTCCTTAC GCATCTGTGC GGTATTTCAC ACCGCATATG 
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Fig. 26/5 

Bstll07l 

4951 GTGCACTCTC AGTACAATCT GCTCTGATGC CGCATAGTTA AGCC AGTATA 
5001 CACTCCGCTA TCGCTACGTG ACTGGGTCAT GGCTGCGCCC CGACACCCGC 
5051 CAACACCCGC TGACGCGCCC TGACGGGCTT GTCTGCTCCC GGCATCCGCT 
5101 TACAGACAAG CTGTGACCGT CTCCGGGAGC TGCATGTGTC AGAGGTTTTC 
5151 ACCGTCATCA CCGAAACGCG CGAGGCAGCT GTGGAATGTG TGTCAGTTAG 
5201 GGTGTGGAAA GTCCCCAGGC TCCCCAGCAG GCAGAAGTAT GCAAAGCATG 
5251 CATCTCAATT AGTCAGCAAC CAGGCTCCCC AGCAGGCAGA AGTATGCAAA 
5301 GCATGCATCT CAATTAGTCA GCAACCATAG TCCCGCCCCT AACTCCGCCC 

5351 ATCCCGCCCC TAACTCCGCC CAGTTCCGCC CATTCTCCGC CCCATGGCTG 

Sfi I 

5401 ACTAATTTTT TTTATTTATG CAGAGGCCGA GGCCGCCTCG GCC TCTGAGC 

Stu I/Avr II 

5451 TATTCCAGAA GTAGTGAGGA GGCTTTTTTG GAGGCCTAGG CTTTTGCAAA 

5501 AAGCTAGCTT CACGCTGCCG CAAGCACTCA GGGCGCAAGG GCTGCTAAAG 

5551 GAAGCGGAAC ACGTAGAAAG CCAGTCCGCA GAAACGGTGC TGACCCCGGA 

5601 TGAATGTCAG CTACTGGGCT ATCTGGACAA GGGAAAACGC AAGCGCAAAG 

5651 AGAT^GCAGG TAGCTTGCAG TGGGCTTACA TGGCGATAGC TAGACTGGGC 

5701 GGTTTTATGG ACAGCAAGCG TUICCGGAATT GCCAGCTGGG GCGCCCTCTG 

5751 GTAAGGTTGG GAAGCCCTGC AAAGTAT^CT GGATGGCTTT CTTGCCGCCA 

Bgl II/Bcl I 

5801 AGGATCTGAT GGCGCAGGGG ATCA AGATCT GATCAA GAGA CAGGATGAGG 

5851 ATCGTTTCGC ATGATTGAAC AAGATGGATT GCACGCAGGT TCTCCGGCCG 

5901 CTTGGGTGGA GAGGCTATTC GGCTATGACT GGGCACAACA GACAATCGGC 

5951 TGCTCTGATG CCGCCGTGTT CCGGCTGTCA GCGCAGGGGC GCCCGGTTCT 

6001 TTTTGTCAAG ACCGACCTGT CCGGTGCCCT GAATGAACTG CAGGACGAGG 

Msg I 

6051 CAGCGCGGCT ATCGTGG CTG GCCA CGACGG GCGTTCCTTG CGCAGCTGTG 
6101 CTCGACGTTG TCACTGAAGC GGGAAGGGAC TGGCTGCTAT TGGGCGAAGT 
6151 GCCGGGGCAG GATCTCCTGT CATCTCACCT TGCTCCTGCC GAGAAAGTAT 
6201 CCATCATGGC TGATGCAATG CGGCGGCTGC ATACGCTTGA TCCGGCTACC 
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6251 TGCCCATTCG ACCACCAAGC GAAACATCGC ATCGAGCGAG CACGTACTCG 

6301 GATGGAAGCC GGTCTTGTCG ATCAGGATGA TCTGGACGAA GAGCATCAGG 

6351 GGCTCGCGCC AGCCGAACTG TTCGCCAGGC TCAAGGCGCG CATGCCCGAC 

6401 GGCGAGGATC TCGTCGTGAC CCATGGCGAT GCCTGCTTGC CGAATATCAT 

6451 GGTGG7\AAAT GGCCGCTTTT CTGGATTCAT CGACTGTGGC CGGCTGGGTG 
Rsr II 

6501 TGG CGGACCG CTATCAGGAC ATAGCGTTGG CTACCCGTGA TATTGCTGAA 

6551 GAGCTTGGCG GCGAATGGGC TGACCGCTTC CTCGTGCTTT ACGGTATCGC 

6601 CGCTCCCGAT TCGCAGCGCA TCGCCTTCTA TCGCCTTCTT GACGAGTTCT 

Nsp V 

6651 TCTGAGCGGG ACTCTGGGG T TCGAAA TGAC CGACCAAGCG ACGCCCAACC 

6701 TGCCATCACG AGATTTCGAT TCCACCGCCG CCTTCTATGA AAGGTTGGGC 

6751 TTCGGAATCG TTTTCCGGGA CGCCGGCTGG ATGATCCTCC AGCGCGGGGA 

Sma I Nru I 

6801 TCTCATGCTG GAGTTCTTCG CCCAC CCCGG GC TCGATCCC C TCGCGA GTT 

6851 GGTTCAGCTG CTGCCTGAGG CTGGACGACC TCGCGGAGTT CTACCGGCAG 

6901 TGCAT^TCCG TCGGCATCCA GGAAACCAGC AGCGGCTATC CGCGCATCCA 

6951 TGCCCCCGAA CTGCAGGAGT GGGGAGGCAC GATGGCCGCT TTGGTCCCGG 

7001 ATCTTTGTGA AGGAACCTTA CTTCTGTGGT GTGACATAAT TGGACAAACT 

7051 ACCTACAGAG ATTTAAAGCT CTAAGGTAAA TATAAAATTT TTAAGTGTAT 

7101 AATGTGTTAA ACTACTGATT CTAATTGTTT GTGTATTTTA GATTCCAACC 

7151 TATGGAACTG ATGAATGGGA GCAGTGGTGG AATGCCTTTA ATGAGGAAAA 

7201 CCTGTTTTGC TCAGAAGAAA TGCCATCTAG TGATGATGAG GCTACTGCTG 

7251 ACTCTCAACA TTCTACTCCT CCAAAAAAGA AGAGAAAGGT AGAAGACCCC 

7301 AAGGACTTTC CTTCAGAATT GCTAAGTTTT TTGAGTCATG CTGTGTTTAG 

7351 TAATAGAACT CTTGCTTGCT TTGCTATTTA CACCACAAAG GAAAAAGCTG 

7401 CACTGCTATA CAAGAAAATT ATGGAAAAAT ATTCTGTAAC CTTTATAAGT 

7451 AGGCATAACA GTTATAATCA TAACATACTG TTTTTTCTTA CTCCACACAG 

7501 GCATAGAGTG TCTGCTATTA ATAACTATGC TCAAAAATTG TGTACCTTTA 
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Fig. 26/7 
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Fig. 27/1 

1 TTGAAGACGAAAGGGCCTCGTGATACGCCTATTTTTATAGGTTAATGTCATGATAATAAT 

61 GGTTTCTTAGACGTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATTTGTTT 

121 ATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGACAATAACCCTGAT AAATGCT 

181 TCAATAATATTGAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTCGCCCTTATTCC 

241 CTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTC ACCCAGAAACGCTGGTGAAAGTAAA 

301 AGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAGCGG 

361 T AAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACTTTTAAAGT 

421 TCTGCTATGTGGCGCGGT ATT ATCCCGTGTTGACGCCGGGCAAGAGCAACTCGGTCGCCG 

481 CATACACTATTCTCAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTAC 

541 GGATGGCATGACAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAACACTGC 

Pvu I 

601 GGCCAACTT ACTTCTGACAACGATCGGAGGACCGAAGGAGCT AACCGCTTTTTTGC ACAA 

661 CATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGAAGCCATACC 

Fsp I 

721 AAACGACGAGCGTGACACCACGATGCCTGCAGCAATGGCAACAACGTTGCGCAAACT ATT 
781 AACTGGCGAACTACTT ACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAGGCGGA 
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Fig. 27/2 

841 TAAAGTTGCAGGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAA 

901 ATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCAGATGGTAA 

961 GCCCTCCCGTATCGT AGTTATCT ACACGACGGGGAGTCAGGCAACTATGGATGAACGAAA 

1021 TAGACAGATCGCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCAAGT 

1081 TT ACTCAT ATAT ACTTT AGATTGATTT AAAACTTCATTTTTAATTT AAAAGGATCT AGGT 

1141 GAAGATCCTTTTTGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTCGTTCCACTG 

1201 AGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCGCGT 

1261 AATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCA 

1321 AGAGCTACCAACTCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCAAATAC 

1381 TGTCCTTCT AGTGT AGCCGTAGTTAGGCCACCACTTCAAGAACTCTGT AGCACCGCCT AC 

1441 ATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCT 

1501 TACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGC AGCGGTCGGGCTGAACGGG 

1561 GGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCT ACACCGAACTGAGAT ACCTACA 

1621 GCGTGAGCTATGAGAAAGCGCC ACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGGT 

1681 AAGCGGC AGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAACGCCTGGTA 

1741 TCTTTATAGTCCTGTCGGGtTTCGCC ACCTCTGACTTGAGCGTCG ATTTTTGTGATGCTC 

1801 GTCAGGGGGGCGGAGCCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGC 

BspLUllI 

1861 CTTTTGCTGGCCTTTTGCTC ACATGTT CTTTCCTGCGTTATCCCCTGATTCTGTGGATAA 

1921 CCGTATTACCGCCTTTGAGTGAGCTGATACCGCTCGCCGCAGCCGAACGACCGAGCGCAG 

1981 CGAGTCAGTGAGCGAGGAAGCGGAAGAGCGCCTGATGCGGT ATTTTCTCCTT ACGC ATCT 

2041 GTGCGGTATTTCACACCGCATATGGTGC ACTCTCAGT ACAATCTGCTCTGATGCCGCATA 
Bstll07 I 

2101 GTTAAGCC AGTATACA CTCCGCTATCGCTACGTGACTGGGTCATGGCTGCGCCCCGACAC 

2161 CCGCCAACACCCGCTGACGCGCCCTGACGGGCTTGTCTGCTCCCGGCATCCGCTTACAGA 

2221 CAAGCTGTGACCGTCTCCGGGAGCTGCATGTGTCAGAGGTTTTCACCGTCATCACCGAAA 

2281 CGCGCGAGGCAGCATGCATCTCAATTAGTCAGCAACCATAGTCCCGCCCCTAACTCCGCC 

2341 CATCCCGCCCCT AACTCCGCCC AGTTCCGCCCATTCTCCGCCCCATGGCTGACTAATTTT 

Sfi I 

2401 TTTTATTTATGCAGAGGCCG AGGCCGCCTCGGCCT CTGAGCTATTCCAGAAGTAGTGAGG 
Stu I/Avr II 

24 61 AGGCTTTTTTGGAGGCCTAGGCTTTTGCAAAAAGCTAGCTTACAGCTCAGGGCTGCGATT 
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Fig. 27 /3 

2521 TCGCGCCAAACTTGACGGCAATCCT AGCGTGAAGGCTGGTAGGATTTT ATCCCCGCTGCC 

2581 ATCATGGTTCGACCATTGAACTGCATCGTCGCCGTGTCCCAAAATATGGGGATTGGCAAG 

2641 AACGGAGACCTACCCTGGCCTCCGCTCAGGAACGAGTTCAAGTACTTCCAAAGAATGACC 

2701 ACAACCTCTTCAGTGGAAGGTAAACAGAATCTGGTGATTATGGGTAGGAAAACCTGGTTC 

27 61 TCCATTCCTGAGAAGAATCGACCTTTAAAGGACAGAATTAATATAGTTCTCAGTAGAGAA 

2821 CTCAAAGAACCACCACGAGGAGCTCATTTTCTTGCCAAAAGTTTGGATGATGCCTT AAGA 

2881 CTTATTGAACAACCGGAATTGGCAAGTAAAGTAGACATGGTTTGGATAGTCGGAGGCAGT 

2941 TCTGTTTACCAGGAAGCCATGAATCAACCAGGCCACCTCAGACTCTTTGTGACAAGGATC 

3001 ATGCAGGAATTTGAAAGTGACACGTTTTTCCCAGAAATTGATTTGGGGAAAT ATAAACTT 

3061 CTCCCAGAATACCCAGGCGTCCTCTCTGAGGTCCAGGAGGAAAAAGGCATCAAGTATAAG 

3121 TTTGAAGTCTACGAGAAGAAAGACTAACAGGAAGATGCTTTCAAGTTCTCTGCTCCCCTC 

Bgl II 

3181 rTAAAGTTATGCATTTTTATAAGACCATGGGACTTTTGCTGGCTTT AGATCT TTGTGAAG 

3241 GAACCTTACTTCTGTGGTGTGACATAATTGGACAAACTACCTACAGAGATTTAAAGCTCT 

3301 AAGGT AAATAT AAAATTTTTAAGTGTATAATGTGTT AAACT ACTGATTCTAATTGTTTGT 

3361 GTATTTTAGATTCCAACCTATGGAACTGATGAATGGGAGCAGTGGTGGAATGCCTTTAAT 

3421 GAGGAAAACCTGTTTTGCTCAGAAGAAATGCCATCTAGTGATGATGAGGCTACTGCTGAC 

3481 TCTCAACATTCTACTCCTCCAAAAAAGAAGAGAAAGGTAGAAGACCCCAAGGACTTTCCT 

3541 TCAGAATTGCTAAGTTTTTTGAGTC ATGCTGTGTTTAGTAATAGAACTCTTGCTTGCTTT 

3601 GOT ATTTACACCACAAAGGAAAAAGCTGCACTGCTAT ACAAGAAAATT ATGGAAAAATAT 

3661 TCTGTAACCTTTATAAGTAGGCATAACAGTTATAATC ATAAC AT ACTGTTTTTTCTTACT 

3721 CCAC ACAGGCATAGAGTGTCTGCTATTAATAACT ATGCTCAAAAATTGTGTACCTTTAGC 

3781 TTTTT AATTTGT AAAGGGGTTAATAAGG7UVTATTTGATGTATAGTGCCTTGACT AGAGAT 
BsaB I 

3841 CATAMCAGCC AT ACCAC ATTTGT AGAGGTTTT ACTTGCTTT AAAAAACCTCCC ACACCT 

Mun I 

3901 CCCCCTGAACCTGAAACATAAAATGAATGCAATTGTTGTTGTTAACTTGTTT ATTGCAGC 
3961 TTATAATGGTTACAAATAAAGC AAT AGCATCACAAATTTCACAAAT AAAGCATTTTTTTC 
4021 ACTGCATTCT AGTTGTGGTTTGTCCAAACTCATCAATGTATCTTATCATGTCTGGATCTA 
4081 ATAAAAGATATTTATTTTCATT AGATATGTGTGTTGGTTTTTTGTGTGCAGTGCCTCTAT 
4141 CTGGAGGCCAGGTAGGGCTGGCCTTGGGGGAGGGGGAGGCCAGAATGACTCCAAGAGCTA 
4201 CAGGAAGGCAGGTCAGAGACCCCACTGGACAAAC AGTGGCTGGACTCTGCACC ATAACAC 
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Fig. 27/4 

EcoR I 

4261 ACAATCAACAGGGGAGTGAGCTGGAAATTTGCTAGCGgATTCcagcacactggcggccgt 
Spe I 

4321 tACTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCATATATGGAGTT 



4381 CCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCC 
4441 ATTGACGTCAATAATGACGT ATGTTCCCATAGT AACGCCAATAGGGACTTTCCATTGACG 
4501 TCAATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGTACATCAAGTGTATCATAT 

4561 GCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATT ATGCCCA 

SnaB I 

4621 GTAC ATGACCTTATGGGACTTTCCTACTTGGC AGT AC ATC TACGTATTAGTCATCGCTAT 



4681 TACCATGGTGATGCGGTTTTGGCAGTACATCAATGGGCGTGGATAGCGGTTTGACTCACG 

4741 GGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCAAAAICA 

4801 ACGGGACTTTCCAAAATGTCGTAACAACTCCGCCCCATTGACGCAAATGGGCGGTAGGCG 

4861 TGTACGGTGGGAGGTCTATATAAGCAGAGCTCGTTTAGTGAACCGTC AGATCGCCTGGAG 

4921 ACGCCATCCACGCTGTTTTGACCTCCATAGAAGACACCGGGACCGATCCAGCCTCCGCGG 

4981 CCGGGAACGGTGCATTGGAACGCGGATTCCCCGTGCCAAGAGTGACGTAAGTACCGCCTA 

5041 TAGAGTCTATAGGCCCACCCCCTTGGCTTCTTATGCATGCTATACTGTTTTTGGCTTGGG 

Bpull02I 

5101 GTCTAT ACACCCCCGCTTCCTCATGTTATAGGTGATGGTATAGCTTAGCCT ATAGGTGTG 

Xcm I 

5161 GGTTATTGACCATTATTGACCACTCCCCTATTGGTGACGATACTTTCCATTACTAATCCA 



5221 TAACATGGCTCTTTGCCACAACTCTCTTTATTGGCTATATGCCAATACACTGTCCTTCAG 

5281 AGACTGACACGGACTCTGTATTTTTACAGGATGGGGTCTCATTTATTATTTACAAATTCA 

5341 CATAT ACAACACCACCGTCCCCAGTGCCCGCAGTTTTT ATTAAAC ATAACGTGGGATCTC 

BspE I 

5401 CACGCGAATCTCGGGTACGTGTTCCGGACATGGGCTCTTCTCCGGTAGCGGCGGAGCTTC 



5461 TACATCCGAGCCCTGCTCCCATGCCTCCAGCGACTCATGGTCGCTCGGCAGCTCCTTGCT 
5521 OCT AAC AGTGGAGGCCAG ACTTAGGCAC AGC ACGATGCCC ACCACCACCAGTGTGCCGCA 



5581 CAAGGCCGTGGCGGTAGGGTATGTGTCTGAAAATGAGCTCggggagcgggcttgcaccgc 

(Pvu II) 

5641 tgacgcatttggaagacttaaggcagcggcagaagaagatgcaggca^ctgagttgttgt 

5701 gttctgataagagtcagaggtaactcccgttgcggtgctgttaacggtggagggcagtgt 

5761 agtctgagcagtactcgttgctgccgcgcgcgccaccagacataatagctgacagactaa 

Mlu I 

5821 cagactgttcctttccatgggtcttttctgcagtcaccqtccttqac ACGCGT CTCGGGA 
Hind III 

5881 AGCTTGCCGCCACCATGGGATGGAGCTGGGTCTTTCTCTTTCTCCTGTCAGGAACTGCAG 

MGWSWVFLFLLSGTA 
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Fig. 27/5 

(Pvu II) 

5941 GTGTCCTCTCTGAGGTCCAGCTCCAACAGTCTGGACCTGAGCTGGTGAAGCCTGGGGCTT 
GVLSEVQLQQSGPELVKPGA 

5Cba I Dra III 

6001 CAGTAAAGATGTCCTGCAAGACT TCTAGj^T ACACATTCACTGAATACACCAT ACACTGGG 
SV.KMSCKTSRYTFT E Y T I H W 

CDR 1 

6061 TOAGACAGAGCCATGGAAAGAGCCTTGAGTGGATTGGAGGTATTAATCCTAACAATGGTA 

VRQSHGKSLBWIG G I N P N N G 
6121 TTCCT AACTACAACCAGAAGTTCAAGGGC AGGGCCAC ATTGACTGT AGGCAAGTCCTCCA 
IPNYNQKFK G RATLTVGKSS 
CDR 2 

6181 GCACCGCCTACATGGAGCTCCGCAGCCTGACATCTGAGGATTCTGCGGTCT ATTTCTGTG 
STAYMELRSLTSEDSAVYFC 

6241 CAAGAAGAAGAATCGCCTATGGTTACGACGAGGGCCATGCTATGGACTACTGGGGTCAAG 
A R RRIAYGYDEGHAMDY W G Q 
CDR 3 BamH I 

6301 GAACCTCAGTCACCGTCTCCTCAGGTGAGT GGATCCT CTGCGCCTGGGCCCAGCTCTGTC 

GTSVTVSS 

6361 CCACACCGCGGTCACATGGCACCACCTCTCTTGCAGCCTCCACCAAGGGCCCATCGGTCT 

S T K G P S V 

6421 TCCCCCTGGCACCCTCCTCCAAGAGCACCTCTGGGGGCACAGCGGCCCTGGGCTGCCTGG 
FPLAPSSKSTSGGTAALGCL 

Age I 

6481 TCAAGGACTACTTCCCCGA ACCGGTG ACGGTGTCGTG6AACTCAGGCGCCCTGACCAGCG 
VKDYFPEPVTVSWNSGALTS 

6541 GCGTGCACACCTTCCCGGCTGTCCTACAGTCCTCAGGACTCT ACTCCCTC AGCAGCGTGG 
GVHTFPAVLQSSGLYSLSSV 
BstE II 

6601 TGACCG TGCCCTCCAGCAGCTTGGGCACCCAGACCTACATCTGCAACGTGAATCACAAGC 
VTVPSSSLGTQTYICNVNHK 

6661 CCAGCAACACCAAGGTGGACAAGAAAGTTGAGCCCAAATCTTGTGACAAAACTCACACAT 
PSNTKVDKKVEPKSCDKTHT 

6721 GCCCACCGTGCCCAGCACCTGAACTCCTGGGGGGACCGTCAGTCTTCCTCTTCCCCCCAA 
CPPCPAPELLGGPSVFLFPP 

6781 AACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACG 
KPKDTLMISRT PEVTCVVVD 

6841 TGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCATA 
VSHEDPEVKFNWYVDGVEVH 

6901 ATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGGGTGGTCAGCGTCC 
NAKTKPREEQYNSTYRVVSV 

6961 TCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACA 
LTVLHQDWLNGKEYKCKVSN 
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Fig. 27 /6 

7021 AAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCC AAAGCC AAAGGGCAGCCCCGAGAAC 
KALPAPIEKTISKAKGQPRE 

7081 CACAGGTGTACACCCTGCCCCCATCCCGGGAGGAGATGACC AAGAACCAGGTCAGCCTGA 
PQVYTLPPSREEMTKNQVSL 

7141 CCTGCCTGGTCAAAGGCTTCT ATCCCAGCG AC ATCGCCGTGGAGTGGGAGAGCAATGGGC 
TCLVKGFYPSDIAVBWESNG 

7201 AGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCC 
QPENNYKTTPPVLDSDGSFF 

7261 TCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCT 
LYSKLTVDKS RWQQGNVFSC 

7321 CCGTGATGCATGAGGCTCTGC AC AACCACT ACACGCAGAAGAGCCTCTCCCTGTCTCCGG 
SVMHEALHNHYTQKSLSLSP 
NgoM I 

7381 GTAAATGAGTGCGACGGCCGGCAAGCCCCGCTCCCCGGGCTCTCGCGGTCGCACGAGGAT 
G K * 

7441 GCTTGGCACGTACCCCCTGTACATACTTCCCGGGCGCCCAGCATGGAAATAAAGCACCGG 
7501 ATCTAATAAAAGATATTTATTTTCATTAGATATGTGTGTTGGTTTTTTGTGTGCAGTGCC 
7561 TCTATCTGGAGGCCAGGTAGGGCTGGCCTTGGGGGAGGGGGAGGCCAGAATGACTCCAAG 
7621 AGCTACAGGAAGGCAGGTCAGAGACCCCACTGGACAAACAGTGGCTGGACTCTGCACCAT 
7681 AACACACAATCAACAGGGGAGTGAGCTGGaaatttgctagcgaattaattc 7731 
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Fig. 28: 
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Fig. 29/1 
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Fig. 30 /I 

Spe I 

1 gaattccagc acac^gcgg ccot tACTAG T TATTAATAG TAATCAATTA 

51 CGGGGTCATT AGTTCATAGC CCATATATGG AGTTCCGCGT TACATAACTT 

101 ACGGTAAATG GCGCGCCTGG CTGACCGCCC AACGACCCCC GCCCATTGAC 

151 GTCAATAATG ACGTATGTTC CCATAGTAAC GCCAATAGGG ACTTTCCATT 

201 GACGTCAATG GGTGGAGTAT TTACGGTAAA CTGCCCACTT GGCAGTACAT 

251 CAAGTGTATC ATATGCCAAG TACGCCCCCT ATTGACGTCA ATGACGGTAA 

301 ATGGCCCGCC TGGCATTATG CCCAGTACAT GACCTTATGG GACTTTCCTA 

SnaB I 

351 CTTGGCAGTA CAT CTACGTA TTAGTCATCG CTATTACCAT GGTGATGCGG 

401 TTTTGGCAGT ACATCAATGG GCGTGGATAG CGGTTTGACT CACGGGGATT 

451 TCCAAGTCTC CACCCCATTG ACGTCAATGG GAGTTTGTTT TGGCACCAAA 

501 ATCAACGGGA CTTTCCAAAA TGTCGTAACA ACTCCGCCCC ATTGACGCAA 

551 ATG6GCGGTA GGCGTGTACG GTGGGAGGTC TATATAAGCA GAGCTCGTTT 

601 AGTGAACCGT CAGATCGCCT GGAGACGCCA TCCACGCTGT TTTGACCTCC 

Sac II 

651 ATAGAAGACA CCGGGACCGA TCCAGCCTCCgCGGCCGGGA ACGGTGCATT 

701 GGAACGCGGA TTCCCCGTGC CAAGAGTGAC GTAAGTACCG CCTATAGAGT 

751 CTATAGGCCC ACCCCCTTGG CTTCTTATGC ATGCTATACT GTTTTTGGCT 

801 TGGGGTCTAT ACACCCCCGC TTCCTGATGT TATAGGTGAT GGTATAGCTT 

851 AGCCTATAGG TGTGGGTTAT TGACCATTAT TGACCACTCC CCTATTGGTG 

901 ACGATACTTT CCATTACTAA TCCATAACAT GGCTCTTTGC CACAACTCTC 

951 TTTATTGGCT ATATGCCAAT ACACTGTCCT TCAGAGACTG ACACGGACTC 

1001 TGTATrrTTA CAGGATGGGG TCTCATTTAT TATTTACAAA TTCACATATA 

1051 CAACACCACC GTCCCGAGTG CCCGCAGTTT TTATTAAACA TAACGTGGGA 

(BspE I) 

1101 TCTCCACGCG AATCTCGGGT ACGTGTIgCafiACATGGGCT CTTCTCCGGT 
1151 AGCGGCGGAG CTTCTACATC CGAGCCCTGC TCCCATGCCT CCAGCGACTC 
1201 ATGGTCGCTC GGCAGCTCCT TGCTCCTAAC AGTGGAGGCC AGACTTAGGC 
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Fig. 30/2 

1251 ACAGCACGAT GCCCACCACC ACCAGTGTGC CGCACAAGGC CGTGGCGGTA 

1301 GGGTATGTGT CTGAAAATGA GCTCggggag cgggcttgca ccgctgacgc 
Aflll 

1351 atttggaaga cttaaggcag cggcagaaga agatgcaggc agctgagttg 

1401 ttgtgttc^ ataagagtca gaggtaactc ccgt^cggt gctgttaacg 

1451 gtggagggca gtgtagtctg agcagtactc gttgctgccg cgcgcgccac 

1501 cagacataat agctgacaga daacagact gttcctltcc atgggtcttt 

Mlu I Hind III 

1551 tctgcagtca ccgtecttga cacacato tc gq gaaacttG CCGCCACCAT 

M 

1601 GGAGACAGAC AGACTCCTGC TATGGGTGCT GCTGCTCTGG GTTCCAGGTT 
ET DT L L LWV L LLW VPG 
(BspE I) 

1651 C CTCCGGAG A CATTGTGATG ACXX^AATCTC CAGACTCTTT GGCTGTGTCT 
SSGDIVMTQSPDSL AVS 

1701 CTAGGGGAGA GGGCCACCAT CAACTGCAAG TCCAGTCAGA GCCTTTTATA 
LGE RATI NC K S S O S L L Y 

Xbai CDR 1 

1751 TTCTAGAA AT CAAAAGAACT ACTTGGCCTG GTATCAGCAG AAACXJAGGAC 

S R N O K N Y LA W Y Q Q K P G 

Konl 

1801 AGCCACCCAA ACTCCTCATC TTTTGGGCTA GCACTAGGGA ATCTGG GGTA 
Q P P K L L I F W A S T R E S G V 

CDR 2 

1851 CgTGATAGGTTCAGTGGCAG TGGGTTTGGG ACAGACTTCA CCCTCACCAT 
PDR FSGS GFG TOF TLTI 

1901 TAGCAGCCTG CAGGCTGAAG ATGTGGCAGT TTATTACTGT CAGCAATATT 

SSL QAE DVAV YYC O O Y 
1951 TTAGCTATCC GCTCACGTTC GGACAAGGGA CCAA6GTGGA AATAA AACGT 
F S Y P L T F GQG TKVE IKR 

CDR 3 
BamHI 

2001 GAGTaoatee ATCTGGGATA AGCATGCTGT TTTCTGTCTG TCCCTAACAT 

2051 GCCCTGTGAT TATGCGCAAA CAACACACCC AAGGGCAGAA CTTTGTTACT 

2101 TAAACACCAT CCTGTTIGCT TCTTTCC TCA GGA ACTGTGG CTGCACCATC 

TV A A PS 

2151 TGTCTTCATC TTCCCGCCAT CTGATGAGCA GTTGAAATCT GGAACTGCCT 

VFI FPP SDEQ LKS GTA 
2201 CTGTTGTGTG CCTGCTGAAT AACTTCTATC CCAGAGAGGC CAAAGTACAG 

SVVC LLN NFY PREA KVQ 
2251 TGGAAGGTGG ATAACGCCCT CCAATCGGGT AACTCCCAGG AGAGTGTCAC 

WKV DNAL QSG NSQ ESVT 
2301 AGAGCAGGAC AGCAAGGACA GCACCTACAG CCTCAGCAGC ACCCTGACGC 
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Hg.30/3 

EQD SKD STYS LSSTLT 
2351 TGAGCAAAGC AGACTACGAG AAACACAAAG TCTACGCCTG CGAAGTCACC 

LSKA DYE KHKVYAC EVT 
2401 CATCAGGGCC TGAGCTCGCC CGTCACAAAG AGCTTCAACA GGGGAGAGTG 

HQG LSSP VTK SFN RGEC 
2451 TTAGAGGGAG AAGTGCCCCC ACCTGCTCCT CAGTTCCAGC CTGACCCCCT 

PspS II 

2501 CCCATCCTTT 6GCCTCTGAC CCTTTTTCCA CAG GGGACCT ACCCCTATTG 

2551 CGGTCCTCXSA GCTCATCTTT CACCTCACCC CCCTCCTCCT CCTTGGCTTT 

2601 AATTATGCTA ATGTTGGAGG AGAATGAATA AATAAAGTGA ATCTTTGCAC 

2651 CTGTGGTGGA TCTAATAAAA GATATTTATT TTCATTAGAT ATGTGTGTTG 

2701 GTTTTTTGTG TGCAGTGCCT CTATCTGGAG GCCAGGTAGG GCTGGCCTTG 

2751 GGGGAGGGGG AGGCCAGAAT GACTCCAAGA GCTACAGGAA GGCAGGTCAG 

2801 AGACCCCACT GGACAAACAG TGGCTGGACT CTGCACCATA ACACACAATC 

2851 AACAGGGGAG TGAGCTGGAA ATTTGCTAGC GAATTCTTGA AGACGAAAGG 

2901 GCCTCGTGAT ACGCCTATTT TTATAGGTTA ATGTCATGAT AATAATGGTT 

2951 TCTTAGACGT CAGGTGGCAC TTTTCGGGGA AATGTGCGCG GAACCCCTAT 

3001 TTGTTTATTT TTCTAAATAC ATTCAAATAT GTATCCGCTC ATGAGACAAT 

3051 AACCCTGATA AATGCTTCAA TAATATTGAA AAAGGAAGAG TATGAGTATT 

3101 CAACATTTCC GTGTCGCCCT TATTCCCTTT TTTGCGGCAT TTTGCCTTCC 

3151 TGTTTTTGCT CACCCAGAAA C6CTGGTGAA AGTAAAAGAT GCTGAAGATC 

3201 AGTTGGGTGC ACGAGTGGGT TACATCGAAC TGGATCTCAA CAGCGGTAAG 

3251 ATCCTTGAGA GTTTTCGCCC CGAAGAACGT TTTCCAATGA TGAGCACTTT 

3301 TAAAGTTCTG CTATGTGGCG CGGTATTATC CCGTGTTGAC GCCGGGCAAG 

3351 AGCAACTCGG TCGCCGCATA CACTATTCTC AGAATGACTT GGTTGAGTAC 

3401 TCACCAGTCA CAGAAAAGCA TCTTACGGAT GGCATGACAQ TAAGAGAATT 

3451 ATGCAGTGCT GCCATAACCA TGAGTGATAA CACTGCGGCC AACTTACTTC 
Pvu I 

3501 TGACAA CGAT CGG AGGACCG AAGGAGCTAA CCGCTTTTTT GCACAACATG 
3551 GGGGATCATG TAACTCGCCT TGATCGTTQG GAACCGGAGC TGAATGAAGC 



126 



EP0 953 639A1 



Fig. 30/4 

3601 CATACCAAAC GACGAGCGTG ACACCACGAT GCCTGCAGCA ATGGCAACAA 

3651 CGTTGCGCAA ACTATTAACT GGCGAACTAC TTACTCTAGC TTCCCGGCAA 

3701 CAATTAATAG ACTGGATGGA GGCGGATAAA GTTGCAGGAC CACTTCTGCG 

3751 CTCGGCCCTT CCGGCTGGCT GGTTTATTGC TGATAAATCT GGAGCCGGTG 

3801 AGCGTGGGTC TCGCX3GTATC ATTGCAGCAC TGGGGCCAGA TGGTAAGCCC 

3851 TCCCGTATCG TAGTTATCTA CACGACGGGG AGTCAGGCAA CTATGGATGA 

3901 ACGAAATAGA CAGATCGCTG AGATAGGTGC CTCACTGATT AAGCATTGGT 

3951 AACTGTCAGA CCAAGTTTAC TCATATATAC TTTAGATTGA TTTAAAACTT 

4001 CATTTTTAAT TTAAAAGGAT CTAGGTGAAG ATCCI I I I IG ATAATCTCAT 

4051 GACCAAAATC CCTTAACGTG AGTnTCGTT CCACTGAGCG TCAGACCCCG 

4101 TAGAAAAGAT CAAAGGATCT TCTTGAGATC CI I M I MC T GCGCGTAATC 

4151 TGCTGCTTGC AAACAAAAAA ACCACCGCTA CCAGCGGTGG TTTGTTTGCC 

4201 GGATCAAGAG CTACCAACTC I I 1 1 ICCGAA GGTAACTGGC TTCAGCAGAG 

4251 CGCAGATACC AAATACTGTC CTTCTAGT6T AGCXJGTAGTT AGGCCACCAC 

4301 TTCAAGAACT CTGTAGCACC GCCTACATAC CTCGCTCTGC TAATCCTGTT 

4351 ACCAGT6GCT GCTGCCAGTG GCGATAAGTC GTGTCTTACC GGGTTGGACT 

4401 CAAGACGATA GTTACCGGAT AAGGCGCAGC GGTCGGGCTG AACGGGGGGT 

4451 TCGTGCACAC AGCCCAGCTT GGAGCGAACX3 ACCTACACCX3 AACTGAGATA 

4501 CCTACAGCGT GAGCTATGAG AAAGCGCCAC GCTTCCCGAA G6GAGAAAGG 

4551 CGGACAGGTA TCXGGTAAGC GGCAG6GTCG GAACAGGAGA GCGCACGAGG 

4601 GAGCTTCCAG GGGGAAACGC CTG6TATCTT TATAGTCCTG TCGGGTTTCG 

4651 CCACCTCTGA CTTGAGCGTC GAI 1 1 ilGTG ATGCTCGTCA GG6GGGCGGA 

4701 GCCTATGGAA AAACGCCAGC AACGCGGCCT TTTTACGGTT CCTGGCCTTT 

BspLUm 

4751 TGCTGGCCTT TTGCTC ACAT GT TCTTTCCT GCGTTATCCC CTGATTCTGT 
4801 GGATAACCGT ATTACCGCCT TTGAGTGAGC TGATACCGCT CGCCGCAGCC 



127 



EP0953639A1 



Fig. 30/5 

4851 GAACGACCGA GCGCAGCGAG TCAGTGAGCG AGGAAGCGGA AGAGCGCCTG 

4901 ATGCGGTATT TTCTCCTTAC GCATCTGTGC GGTATTTCAC ACCGCATATG 

BstHOTI 

4951 GTGCACTCTC AGTACAATCT GCTCTGATGC CGCATAGTTA AGCCAGIflJift 
5001 CACTCCGCTA TCGCTACGTG ACTGGGTCAT GGCTGCGCCC CX3ACACCCGC 
5051 CAACACCCGC TGACGCGCCC TGACGGGCTT GTCTGCTCCC GGCATCCGCT 
5101 TACAGACAAG CTGTGACCGT CTCCGGGAGC TGCATGTGTC AGAGGTTTTC 
5151 ACCGTCATCA CCGAAAC6CG CGAGGCAGCT GTGGAATGTG TGTCAGTTAG 
5201 GGTGTGGAAA GTCCCCAGGC TCCCCAGCAG GCAGAAGTAT GCAAAGCATG 
5251 CATCTCAATT AGTCAGCAAC CAGGCTCCCC AGCAGGCAGA AGTATGCAAA 
5301 GCATGCATCT CAATTAGTCA GCAACCATAG TCCCGCCCCT AACTCCGCCC 
5351 ATCCCGCCCC TAACTCCGCC CAGTTCCGCC CATTCTCCGC CCCATGGCTG 

srii 

5401 ArTAATTTTT TTTAnTATft rAftAftftrrfiA QGCCGCCTCQ GCCTCTGAGC 

Stu l/Avr II 

5451 TATTCCA6AA GTAGTGAGGA GGCI I I I I IG G AGGCCTAGG CTTTTGCAAA 

5501 AAGCTAGCTT CACGCTGCCG CAAGCACTCA GGGCGCAAGG GCTGCTAAAG 

5551 GAAGCGGAAC ACGTAGAAAG CCAGTCCGCA GAAACGGTGC TGACCCCGGA 

5601 TGAATGTCAG CTACTGGGCT ATCTGGACAA GGGAAAACGC AAGCGCAAAG 

5651 AGAAAGCAGG TAGCTTGCAG TGGGCTTACA TGGCGATAGC TAGACTGGGC 

5701 GGTTTTAT6G ACAGCAAGCG AACCGGAATT GCCA6CTGGG GCGCCCTCTG 

5751 GTAAGGTTGG GAAGCCCTGC AAAGTAAACT GGATGGCTTT CTT6CCGCCA 

Bgl ll/Bcl I 

5801 AGGATCTGAT GGCGCAGGGG ATC AAGATCT GATCAA GAGA CAGGATGAGG 

5851 ATCGTTTCGC ATGATTGAAC AAGATGGATT GCACGCAGGT TCTCCGGCCG 

5901 CTTGGGTGGA GAGGCTATTC GGCTATGACT GGGCACAACA GACAATCGGC 

5951 TGCTCTGATG CCGCCGTGTT CCGGCTGTCA GCGCAGGGGC GCCCGGTTCT 

6001 TTTTGTCAAG ACCGACCTGT CCGGTGCCCT GAATGAACTG CAGGACGAGG 

Msci 

6051 CAGCGCGGCT ATCQTGGC TQ GCCAC GACGG GCGTTCCTTG CGCAGCTGTG 
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Fig. 30/6 

6101 CTCGACGTTG TCACTGAAGC GGGAAGGGAC TGGCTGCTAT TGGGCGAAGT 

6151 GCCGGGGCAG GATCTCCTGT CATCTCACCT TGCTCCTGCC GAGAAAGTAT 

6201 CCATCATGGC TGATGCAATG CGGCGGCTGC ATACGCTTGA TCCGGCTACC 

6251 TGCCCATTCG ACCACCAAGC GAAACATCGC ATCGAGCGAG CACGTACTCG 

6301 GATGGAAGCC GGTCTTGTCG ATCAGGATGA TCTGGACGAA GAGCATCAGG 

6351 GGCTCGCGCC AGCCGAACTG TTCGCCAGGC TCAAGGCGCG CATGCCCGAC 

6401 GGCGAGGATC TCGTCGTGAC CCATGGCGAT GCCTGCTTGC CGAATATCAT 

6451 GGTGGAAAAT GGCCGCTTTT CTGGATTCAT CGACTGTGGC CGGCTGGGTG 
Rsr 11 

6501 TGG CGGACCG CTATCAGGAC ATAGCGTTGG CTACCCGTGA TATTGCTGAA 

6651 GAGCTTGGCQ GCGAATGGGC T6ACCGCTTC CTCGTGCTTT ACGGTATCGC 

6601 CGCTCCCGAT TCGCAGCGCA TCGCCTTCTA TCGCCTTCTT GACGAGTTCT 

NspV 

6651 TCTGAGCGGG ACTCTGGGG T TCGAAA TGAC C6ACCAAGCG ACGCCCAACC 

6701 TGCCATCACG AGATTTCGAT TCCACCGCCG CCTTCTATGA AAGGTTGGGC 

6751 TTCGGAATCG TTTTCCGGGA CGCCG6CTGG ATGATCCTCC AGCGCGG6GA 

Sma I Nru I 

6801 TCTCATGCTG GAGTTCTTCG CCCA CCCCGG GC TCGATCCC C TCGCGA GTT 

6851 GGTTCAGCTG CTGCCTGAGG CTGGACGACC TCGCGGAGTT CTACCGGCAG 

6901 TGCAAATCCGTCX3GCATCCAGGAAACCAGCAGCGGCTATCCGCGCATCCA 

6951 TGCCCCCGAA CTGCAGGAGT GGGGAGGCAC GATGGCCGCT TTGGTCCCGG 

7001 ATCTTTGTGA AGGAACCTTA CTTCTGTGGT GTGACATAAT TGGACAAACT 

7051 ACCTACAGAG ATTTAAAGCT CTAAGGTAAA TATAAAATTT TTAAGTGTAT 

7101 AATGTGTTAA ACTACTGATT CTAATTGTTT GTGTATTTTA GATTCCAACC 

7151 TATGGAACTG ATGAATGGGA GCAGTGGTGG AATGCCTTTA ATGAGGAAAA 

7201 CCTGTTTTGC TCAGAAGAAA TGCCATCTAG TGATGATGAG GCTACTGCTG 

7251 ACTCTCAACA TTCTACTCCT CCAAAAAAGA AGAGAAAGGT AGAAGACCCC 

7301 AAGGACTTTC CTTCAGAATT GCTAAGTTTT TTGAGTCATG CTGTGTTTAG 
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Fig. 30/7 

7351 TAATAGAACT CTTGCTTGCT TTGCTATTTA CACCACAAAG GAAAAAGCTG 

7401 CACTGCTATA CAAGAAAA7T ATGGAAAAAT ATTCTGTAAC CTTTATAAGT 

7451 AGGCATAACA GTTATAATCA TAACATACTG TTTTTTCTTA CTCCACACAG 

7501 GCATAGAGTG TCTGCTATTA ATAACTATGC TCAAAAATTG TGTACCTTTA 

7551 GCTTTTTAAT TTGTAAAGGG GTTAATAAGG AATATTTGAT GTATAGTGCC 

7601 TTGACTAGAG ATCATAATCA GCCATACCAC ATTTGTAGAG GTTTTACTTG 

7651 CTTTAAAAAA CCTCCCACAC CTCCCCCTGA ACCTGAAACA TAAAATGAAT 
Mun I 

7701 GCAATTGT TG TTGTTAACTT GTTTATTGCA GCTTATAATG GTTACAAATA 
7751 AAGCAATAGC ATCACAAATT TCACAAATAA AGCATTTTTT TCACTGCATT 
7801 CTAGTTGTGG TTTGTCCAAA CTCATCAATG TATCTTATCA TGTCTGGATC 
7851 TAATAAAAGA TATTTATTTT CATTAGATAT GTGTGTTGGT TTTTTGTGTG 
7901 CAGTGCCTCT ATCTGGAGGC CAGGTAGGGC TGGCCTTGGG GGAGGGGGAG 
7951 GCCAGAATGA CTCCAAGAGC TACAGGAAG6 CAGGTCAGAG ACCCCACTGG 
8001 ACAAACAGTG GCTGGACTCT GCACCATAAC ACACAATCAA CAGGGGAGTG 
8051 AGCTGGAAATTTGCTAGC 
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Fig. 32/1 
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Fig. 32/2 
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Fig. 33/1 



1 TTGAAGACGAAAGGGCCTCGTGATACGCCTATTTTTATAGGTTAATGTCATGATAATAAT 
61 GGTTTCTTAGACGTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATTTGTTT 
121 ATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGACAATAACCCTGATAAATGCT 
181 TCAATAATATTGAAAAAGGAAGAGT ATGAGTATTCAACATTTCCGTGTCGCCCTTATTCC 
241 CTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAGTAAA 
30 1 AGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAGCGG 
361 T AAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACTTTTAAAGT 
421 TCTGCTATGTGGCGCGGTATTATCCCGTGTTGACGCCGGGCAAGAGCAACTCGGTCGCCG 
481 CATACACTATTCrCAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTAC 
541 GGATGGCATGACAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAACACTGC 

Pvu I 

601 GGCCAACTTACTTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGCTTTTTTGCACAA 



661 CATGGGGGATCATGT AACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGAAGCCATACC 

Fsp I 

721 AAACGACGAGCGTGACACCACGATGCCTGCAGCAATGGCAACAACGTTCCGCAAACTATT 



781 AACTGGCGAACTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAGGCGGA 
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Hg.33/2 

841 TAAAGTTGCAGGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAA 

901 ATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCAGATGGTAA 

961 GCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTATGGATGAACGAAA 

1021 TAGACAGATCGCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCAAGT 

1081 TTACTC ATATATACTTTAGATTGATTTAAAACTTCATTTTTAATTT AAAAGGATCTAGGT 

1141 GAAGATCCTTTTTGATAATCTCATGACCAAAATCCCTT AACGTGAGTTTTCGTTCCACTG 

1201 AGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCGCGT 

1261 AATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCA 

1321 AGAGCTACCAACTCTTTTTCCGAAGGT AACTGGCTTCAGCAGAGCGCAGATACCAAATAC 

1381 TGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTAGCACCGCCTAC 

1441 ATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCT 

1501 TACCGGGTTGGACTCAAGACGAT AGTT ACCGGAT AAGGCGC AGCGGTCGGGCTGAACGGG 

1561 GGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCT AC ACCGAACTGAGAT ACCT ACA 

1621 GCGTGAGCTATGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGGT 

1681 AAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAACGCCTGGTA 

1741 TCTTTATAGTCCTGTCGGGTTTCGCC ACCTCTGACTTGAGCGTCGATTTTTGTGATGCTC 

1801 GTCAGGGGGGCGGAGCCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGC 

BspLUllI 

1861 CTTTTGCTGGCCTTTTGCT CACATGTT CTTTCCTGCGTTATCCCCTGATTCTGTGGATAA 

1 92 1 CCGT ATT ACCGCCTTTGAGTGAGCTGATACCGCTCGCCGCAGCCGAACGACCGAGCGC AG 

1 98 1 CGAGTCAGTGAGCGAGGAAGCGGAAGAGCGCCTGATGCGGTATTTTCTCCTTACGCATCT 

204 1 GTGCGGTATTTCACACCGCATATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATA 
BstllO? I 

2101 GTTAAGCC AGTATACA CTCCGCTATCGCTACGTGACTGGGTCATGGCTGCGCCCCGACAC 
2161 CCGCC AACACCCGCTGACGCGCCCTGACGGGCTTGTCTGCTCCCGGCATCCGCTTAC AGA 
2221 CAAGCTGTGACCGTCTCCGGGAGCTGCATGTGTCAGAGGTTTTCACCGTCATCACCGAAA 
2281 CGCGCGAGGCAGCATGCATCTCAATTAGTCAGCAACCATAGTCCCGCCCCTAACTCCGCC 

2341 CATCCCGCCCCTAACTCCGCCCAGTTCCGCCCATTCTCCGCCCCATGGCTGACTAATTTT 

Sfi I 

2401 TTTTATTTATGCAGAGGCCG AGGCCGCCTCGGCC TCTGAGCTATTCCAGAAGTAGTGAGG 
Stu I/Avr II 

2461 AGGCTTTTTTGGAGGCCTAGGCTTTTGCAAAAAGCTAGCTTACAGCTCAGGGCTGCGATT 
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Fig. 33/2 

2521 TCGCGCCAAACTTGACGGCAATCCTAGCGTGAAGGCTGGTAGGATTTTATCCCCGCTGCC 

2581 ATCATGGTTCGACCATTGAACTGCATCGTCGCCGTGTCCCAAAATATGGGGATTGGCAAG 

2641 AACGGAGACCTACCCTGGCCTCCGCTCAGGAACGAGTTCAAGT ACTTCCAAAGAATGACC 

2701 ACAACCTCTTCAGTGGAAGGTAAACAGAATCTGGTGATTATGGGT AGGAAAACCTGGTTC 

2761 TCCATTCCTGAGAAGAATCGACCTTTAAAGGACAGAATTAATATAGTTCTCAGTAGAGAA 

2821 CTCAAAGAACCACCACGAGGAGCTCATTTTCTTGCCAAAAGTTTGGATGATGCCTTAAGA 

2881 CTTATTGAACAACCGGAATTGGCAAGTAAAGTAGACATGGTTTGGATAGTCGGAGGCAGT 

2941 TCTGTTTACCAGGAAGCC ATGAATCAACCAGGCCACCTCAGACTCTTTGTGACAAGGATC 

3001 ATGCAGGAATTTGAAAGTGAC ACGTTTTTCCC AGAAATTGATTTGGGG AAAT AT AAACTT 

3061 CTCCCAGAATACCCAGGCGTCCTCTCTGAGGTCCAGGAGGAAAAAGGCATCAAGTATAAG 

3121 TTTGAAGTCTACGAGAAGAAAGACT AACAGGAAGATGCTTTCAAGTTCTCTGCTCCCCTC 

Bgl II 

3181 CTAAAGCTATGCATTTTTATAAGACCATGGGACTTTTGCTGGCTT TAGATCT TTGTGAAG 

3241 GAACCTTACTTCTGTGGTGTGACATAATTGGACAAACT ACCTACAGAGATTTAAAGCTCT 

3301 AAGGTAAATATAAAATTTTTAAGTGTATAATGTGTTAAACTACTGATTCTAATTGTTTGT 

3361 GTATTTTAGATTCCAACCT ATGGAACTGATGAATGGGAGCAGTGGTGGAATGCCTTTAAT 

3421 GAGGAAAACCTGTTTTGCTCAGAAGAAATGCCATCTAGTGATGATGAGGCTACTGCTGAC 

3481 TCTCAACATTCTACTCCTCCAAAAAAGAAGAGAAAGGTAGAAGACCCCAAGGACTTTCCT 

3541 TCAGAATTGCTAAGTTTTTTGAGTCATGCTGTGTTTAGTAATAGAACTCTTGCTTGCTTT 

3601 GCTATTTACACCACAAAGGAAAAAGCTGCACTGCTATACAAGATUU^TTATGGAAAAATAT 

3661 TCTGTAACCTTTATAAGTAGGCATAAC AGTT ATAATCATAACATACTGTTTTTTCTTACT 

3721 CCACACAGGCATAGAGTGTCTGCTATT AATAACTATGCTCAAAAATTGTGTACCTTTAGC 

3781 TTTTTAATTTGTAAAGGGGTT AATAAGGAATATTTGATGTAT AGTGCCTTGACTAGAGAT 
BsaB I 

3841 CATAATCA GCCATACCACATTTGTAGAGGTTTTACTTGCTTTAAAAAACCTCCCACACCT 

Mun I 

3901 CCCCCTGAACCTGAAACAT AAAATGAATGCAATTCTTGTTGTTAACTTGTTT ATTGC AGC 
3961 TTATAATGGTT AC AAATAAAGCAAT AGCATCAC AAATTTC ACAAAT AAAGCATTTTTTTC 
4021 ACTGC ATTCT AGTTGTGGTTTGTCCAAACTC ATC AATGT ATCTT ATCATGTCTGGATCT A 
4081 AT AAAAGAT ATTTATTTTCATT AG ATATGTGTGTTGGTTTTTTGTGTGCAGTGCCTCTAT 
4141 CTGGAGGCCAGGTAGGGCTGGCCTTGGGGGAGGGGGAGGCCAGAATGACTCCAAGAGCTA 



135 



EP0 953 639A1 



Fig. 33/3 

4201 CAGGAAGGCAGGTCAGAGACCCCACTGGACAAACAGTGGCTGGACTCTGCACCATAACAC 

EcoR I 

4261 ACAATCAACAGGGGAGTGAGCTGGAAATTTGCTAGC GAATTC caqcacactggcggccgt 
(Spe I) 

4321 tACTAGTTATTAATAGTAATCAATTACGGGGTCATTAGTTCATAGCCCATATATGGAGTT 



4381 CCGCGTTACATAACTTACGGTAAATGGCCCGCCTGGCTGACCGCCCAACGACCCCCGCCC 

4441 ATTGACGTCAAT AATGACGT ATGTTCCCAT AGTAACGCCAATAGGGACTTTCCATTGACG 

4501 TCAATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGT ACATCAAGTGTATCAT AT 

4561 GCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGCATTATGCCCA 

SnaB I 

4621 GTACATGACCTTATGGGACTTTCCTACTTGGCAGTACATC tACQTAT TAGTCATCGCTAT 

4681 TACCATGGTGATGCGGTTTTGGCAGTACATCAATGGGCGTGGATAGCGGTTTGACTCACG 

4741 GGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCAAAATCA 

4 801 ACGGGACTTTCCAAAATGTCGTAACAACTCCGCCCCATTGACGCAAATGGGCGGTAGGCG 

4 861 TGTACGGTGGGAGGTCT AT AT AAGCAGAGCTCGTTT AGTGAACCGTCAGATCGCCTGGAG 

4921 ACGCCATCCACGCTGTTTTGACCTCCATAGAAGACACCGGGACCGATCCAGCCTCCGCGG 

4981 CCGGGAACGGTGCATTGGAACGCGGATTCCCCGTGCCAAGAGTGACGTAAGTACCGCCTA 

5041 TAGAGTCTATAGGCCCACCCCCTTGGCTTCTTATGCATGCTATACTGTTTTTGGCTTGGG 

Bpull02I 

5101 GTCTATACACCCCCGCTTCCTCATGTTATAGGTGATGGTAT AGCTTAQCC TATAGGTGTG 

Xcm I 

5161 GGTTATTGACCATTATTGACCACTCCCCTATTGGTGACGATACTTTCCATTACTAATCCA 



5221 T AACATGGCTCTTTGCC ACAACTCTCTTT ATTGGCTAT ATGCCAAT ACACTGTCCTTC AG 

5281 AGACTGACACGGACTCTGTATTTTTACAGGATGGGGTCTCATTTATTATTTACAAATTCA 

5341 CATATACAACACCACCGTCCCCAGTGCCCGCAGTTTTTATTAAACATAACGTGGGATCTC 

BspE I 

5401 CACGCGAATCTCGGGT ACGTGTTCCGGACATGGGCTCTTCTCCGGTAGCGGCGGAGCTTC 



5461 T ACATCCGAGCCCTGCTCCCATGCCTCCAGCGACTCATGGTCGCTCGGCAGCTCCTTGCT 

5521 CCTAACAGTGGAGGCCAGACTTAGGCACAGCACGATGCCCACCACCACCAGTGTGCCGCA 

5581 CAAGGCCGTGGCGGT AGGGTATGTGTCTGAAAATGAGCTCggggag egg gc t tg caccgc 

(Pvu II) 

5641 tgacgcatttggaagacttaaggcagcggcagaagaagatgcaggcagctgagttgttgt 

5701 gttctgataagagtcagaggtaactcccgttgcggtgctgttaacggtggagggcagtgt 

57 61 agtctgagcagtactcgttgctgccgcgcgcgccaccagacataatagctgacagactaa 

Mlu I 

5821 cagactgttcctttccatgggtcttttctgcagtcaccgtccttgacACGCGTCTCGGC^ 
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Fig. 33/4 

Hind III 

5881 AGCTT GCCGCCACOVTGGACTGGACCTGGCGCGTGTTTTGCCTGCTCGCCGTGGCTCCTG 

MDWTWRVFCLLAVAP 

5941 GGGCCCACAGCCAGGTGCAACTGGTGCAGTCCGGCGCCGAAGTGAAGAAACCCGGTGCTT 
GAHSQVQLVQSGAEVKKPGA 
(Pvu II) (Spe I) 

6001 CCGTGAAAGT CAGCTGT AAA ACTAGTA GATACACCTTCACTGAATACACCATACACTGGG 
SVKVSCKTSRYTFT E Y T I H W 
Msg I CDR 1 

6061 TTAGACAGGCCCCTCGCCAAAGGCTGGAGTGGATAGGAGGTATTAATCCTAACAATGGTA 
VRQAPGQRLEWIG G I N P N N G 

6121 TTCCTAACTACAACCAGAAGTTCAAGGGCCGGGCCACCTTGACCGTAGGCAAGTCTGCCA 
IPNYNQKFKG RATLTVGKSA 
CDR 2 

6181 GCACCGCCTACATGGAACTGTCCAGCCTGCGCTCCGAGGACACTGCAGTCTACTACTGCG 
STAYMELSSLRSEDTAVYYC 

6241 CCAGAAGAAGAATCGCCTATGGTTACGACGAGGGCCATGCTATGGACTACTGGGGTCAAG 
A R RRIAYGYDEGHAMD Y W G Q 
CDR 3 BamH I 

6301 GAACCCTTGTCACCGTCTCCTCAGGTGAGT SGATCC TCTGCGCCTGGGCCCAGCTCTGTC 
GTLVTVSS 

6361 CCACACCGCGGTCACATGGCACCACCTCTCTTGCAGCCTCCACCAAGGGCCCATCGGTCT 

S T K G P S V 

6421 TCCCCCTGGCACCCTCCTCCAAGAGCACCTCTGGGGGCACAGCGGCCCTGGGCTGCCTGG 
FPLAPSSKSTSGGTAALGCL 

Age I 

6481 TCAAGGACTACTTCCCCGAACCGGTGACGGTGTCGTGGAACTCAGGCGCCCTGACCAGCG 
VKDYFPEPVTVSWNSGALTS 

6541 GCGTGCACACCTTCCCGGCTGTCCTACAGTCCTCAGGACTCTACTCCCTCAGCAGCGTGG 
GVHT F PAVLQ S S GLY S L S SV 
BstE II 

6601 TGACC GTGCCCTCCAGCAGCTTGGGCACCCAGACCTACATCTGCAACGTGAATCACAAGC 
VTVPSSSLGTQTYICNVNHK 

6661 CCAGCAACACCAAGGTGGACAAGAAAGTTGAGCCCAAATCTTGTGACAAAACTCACACAT 
PSNTKVDKKVEPKSCDKTHT 

6721 GCCCACCGTGCCCAGCACCTGAACTCCTGGGGGGACCGTCAGTCTTCCTCTTCCCCCCAA 
CPPCPAPELLGGPSVFLFPP 

6781 AACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACG 
KPKDTLMISRT PEVTCVVVD 

6841 TGAGCC ACGAAGACCCTGAGGTC AAGTTCAACTGGT ACGTGGACGGCGTGGAGGTGCATA 
VSHED .PEVKFNWYVDGVEVH 

6901 ATGCCAAGACAAAGCCGCGGGAGGAGC AGT ACAACAGCACGT ACCGGGTGGTC AGCGTCC 
NAKTKPREEQYNSTY.RVV SV 
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Fig. 33/5 

6961 TCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACA 
LTVLHQDWLNGKEYKCKVSN 

7021 AAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAAC 
KALPAPIEKTISKAKGQPRE 

7081 CACAGGTGTACACCCTGCCCCCATCCCGGGAGGAGATGACCAAGAACCAGGTC AGCCTGA 
PQVYTLPPSREEMTKNQVSL 

7141 CCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGC 
TCLVKGFY. PSDIAVEWBSNG 

7201 AGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCC 
QPENNYKTTPPVLDSDGSFF 



7261 TCTACAGCT^GCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCT 
LYSKLTVDKSRWQQGNVFSC 

7321 CCGTGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGG 
SVMHEALHNHYTQKSLSLSP 
NgoM I 

7381 GTAAATGAGTGCGACGGOCGGCAAGCCCCGCTCCCCGGGCTCTCGCGGTCGCACGAGGAT 
G K * 

7441 GCTTGGCACGTACCCCCTGTACATACTTCCCGGGCGCCCAGCATGGAAATAAAGCACCGG 
7501 ATCTAATAAAAGATATTTATTTTC ATTAGATATGTGTGTTGGTTTTTTGTGTGCAGTGCC 
7561 TCTATCTGGAGGCCAGGTAGGGCTGGCCTTGGGGGAGGGGGAGGCCAGAATGACTCCAAG 
7621 AGCTACAGGAAGGCAGGTCAGAGACCCCACTGGACAAACAGTGGCTGGACTCTGCACCAT 
7681 AACACACAATCAACAGGGGAGTGAGCTGGaaatttgctagcgaattaattc 7731 



Fig. 34 A 

INTRON 

3' end V gene 5* end of CHI 

ACC GTC TCC TC4 G::GTGAGTGQfia£C^4i'CCTCT(nTGCAG::CC' 
TVS S spUce donor site BamHI splice acceptor site 

-TCC ACC AAGGGC 
S T K G U 

ACC GTC TCC TCA G::::CC TCC ACC AAG GGC 
TVSS STKG 

ACC GTC TCC TCA GCC TCC ACC AAG GGC 
TVSSASTKG 
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Fig. 34 B 

INTRON 

3' end V gene 5* end Kappa constant 

GAA ATA AA^ C::GTGAGTQQMCQ-(^iQi-CTTCTTTCCTCAG::GA' 
E I K spUee donor si teBamHI splice acceptor site 

-ACT GTG OCT OCA 
T V A A 



u 

GAA ATA AA/4 Cv.vCA ACT GTG GCT GCA 
E I K T V A A 

U 

GAA ATA AAA CGA ACT GTG GCT GCA 
E I K R T V A A 



Fig. 35 




Relatne dilution 
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Fig. 36 
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